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TTFP ATms C VIRUS rHCVi POLYPEPTIDES 



10 Technical Field 

The invention relates to materials and methodologies for managing 
the spread of hq>atitis C vims (HCV) infection. More specifically, it relates to 
polypeptides useful as immunological reagents in the detection, prevention and 
treatment of HCV infections. 

15 

Papkgypund 

HCV was first identified and characterized as a cause of non-A, 
non-B hepatitis (NANBH) by Houghton et al. This led to the disclosure of a 
number of general and specific polypeptides useful as immunological reagents. 

20 See, e.g., Houghton et al., EPO Pub. No. 318,216; Houghton et al., EPO Pub. 
No. 388,232; Choo et al„ Science (1989) 244:359-362; Kuo et aL, Science (1989) 
244:362-364; Houghton et al., Hepatologv fl99n 14:381-388. These 
publications provide the ait with an extensive background on HCV generally, as 
well as the manufacture and uses of HCV polypeptide immunological reagents. 

25 For brevity, therefore, the disclosure of these publications in particular are 
incoi]porated herein by reference. 

Others have readily applied and extended the work of Houghton et 
al. See, e.g., Highfield et al., UK Pat. App. 2,239,245 (The Wellcome 
Foundation Ltd.); Wang, EPO Pub. No. 442,394 (United Biomedical Inc.); Leung 

30 et al., EPO Pub. No. 445,423 (Abbott Laboratories); Habits et al., EPO Pub. No. 
451,891 (Akzo N.V.); Reyes et al., PCT Pub. No. WO 91/15516 (Genelabs Inc.); 
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Maki et aL, EPO Pub. No. 468,657 (Tonen Corp.); and Kamada et al., EPO Pub. 
No. 469,348 (Shionogi Seiyaku K.K.). 

Sensitive, specific methods for screening and identifying carriers of 
HCV and HCV-contaminated blood or blood products are an important advance in 
5 medicine. Post-transfusion hepatitis (FTS) occurs in s^proximately 10% of 
transfused patients, and HCV has accounted for up to 90% of these cases. The 
major problem in this disease is the frequent progression to chronic liver damage 
(25-55 %). Patient care as well as the prevention of transmission of HCV by 
blood and blood products or by close personal contact require reliable diagnostic 

10 and prognostic tools, such as HCV polypeptides, to detect antibodies related to 
HCV, Such polypeptides are also useful as vaccines and immunotherapeutic 
therapeutic agents for the prevention and/or treatment of the disease. 

Since HCV is a relatively new agent, a continuing need CTSts to 
define additional immunological reagents that will aUow furtfier study of tiie 

15 clinical course of disease and the epidemiology of HCV in the population. 



Disclosure of the Invention 

The invention pertains to the characterization of new HCV q)itopes. 
The characterization of these epitopes permits the manufacture of polypeptide 

20 products which reacted hmnunologically with antibodies to HCV and/or generate 
anti-HCV antibody production in vivo. These polypeptide products are useful as 
standards or reagents in diagnostic tests and/or as components of vaccmes. Anti- 
bodies, including for example botii polyclonal and monoclonal, directed against 
HCV epitopes contained within these polypeptide sequences are also useful 

25 reagents, for exanqjle, m diagnostic tests, as therapeutic agents, for screening of 
antiviral agents, and for tiie isolation of isolation/purification of HCV polypq)tides 
or particles. 

In its broadest sense, the present invention is directed to 
polypeptides containing the newly characterized HCV epitopes disclosed herem, 
30 methods of marrafacturing such polypeptides (e.g., recombinant and synthetic 
mefliods), methods of using such polypq)tides (e.g., diagnostic, vaccine, and 
therapeutic), and articles of manufacture, compositions or formulations adapted to 
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such uses (e.g., polypq)tides fixed to an immunoassay or otlier support, oral or 
injectable pharmaceutical compositions). Similarly, antibodies (polycl nal, 
monoclonal, or equivalents such as binding fragments, single-chain antigen-binding 
proteins, etc.) to the HCV q)itopes disclosed h^ein are also included within the 
5 scope of the present invention, as well as methods of making such antibodies, 
methods of using such antibodies (e.g., diagnostic, vaccine, and therapeutic), and 
articles of manufacture, compositions or formulations ndaptad to such uses (e.g., 
antibodies fixed to an immunoassay or other sugppoit, oral or injectable 
pharmaceutical compositions). 

10 Other a^}ects of the invention pertain to kits for analyzing samples 

for the presence of an HCV antigen comprising the above antibodies in a suitable 
container. Still other aspects of the invention pertain to kits for analyzing samples 
for the presence of an antibodies directed against an HCV antigen comprising a 
polypeptide as described above in a suitable container. 

IS Still other aspects of the invention are: a method for producing a 

polypeptide containing the newly disclosed HCV q>itopes comprising incubating 
host cells transformed with an e?q)ression vector contaimng a sequence encoding a 
polypeptide contaming the HCV epitope und^ conditions which allow expression 
of said polypeptide; and a polypeptide contaimng such an HCV qiitope produced 

20 by this method. 

Immunoassays are also included in the invention. These include an 
immunoassay for detecting an HCV antigen comprising incubating a sample 
suspected of containing an HCV antigen with an antibody as described above 
under conditions which allow the formation of an antigen-antibody complex; and 

25 detecting an antigen-antibody complex containing the antibody. An inununoassay 
for detecting anti-HCV antibodies comprising incubating a sanq}Ie suspected of 
containing anti-HCV antibodies with a polypq)tide as described above, under 
conditions which allow the formation of an antibody-antigen complex; and 
detecting the antibody-antigen complex containing the polypeptide. 

30 Also included in the invention are vaccines for treatment of HCV 

infection comprising an immunogenic peptide containing an HCV epitope 
described herein. 
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Yet another aspect of the inventioii is a m^od for producing 
antibodiBS to HCV comprising administaing to an individual an isolated 
immunogenic polypeptide containing an HCV epitope described herein in an 
amnnnt suffideot to pioduce an immune response. 
5 The above aspects of the present invention are accomplished by the 

discovery of HCV epitopes of the fonnula 

aa,-aay 

wherein aa denotes an amino add; 
X and y are integers such that y-x ^ 6; 
10 aax-aa^ indicates a portion of the amino add sequence of Figure 1; 

and 

X is selected fitom the group consisting of 23-34, 36, 66-79, 81-94, 
96-98, 101-103, 186-189, 191, 206, 223, 232, 256, 286, 297-299, 321, 347, 357, 
413, 414, 432, 465-471, 480484, 501, 502, 521, 540-549, 579, 594-599, 601- 

15 613, 641, 662-665, 685, 705, 706, 729, 782-789, 801, 851-855, 893, 916, 928, 
946, 952-954, 1026, 1072, 1109, 1112-1117, 1218, 1240, 1280-1285, 1322, 1338, 
1371, 1384, 1410, 1411, 1454, 1492, 1493, 1532-1535, 1560, 1561, 1566-1568, 
1571-1577, 1601-1607, 1615-1620, 1655, 1695, 1710-1712, 1728, 1729, 1758- 
1762, 1781, 1808, 1821, 1851, 1880, 1908-1913, 1925, 1940-1948, 1951, 1966- 

20 1969, 1999, 2001-2004, 2006-2014, 2024, 2048-2053, 2055-2057, 2071, 2088- 
2093, 2108, 2122-2148, 2165, 2187, 2226-2232, 2244-2249, 2267, 2281-2286, 
2288, 2289, 2325-2327, 2346, 2347, 2349, 2382, 2401, 2417-2422, 2439-2444, 
2446-2456, 2469, 2471-2476, 2495, 2533, 2534, 2573-2578, 2602-2604, 2606- 
2612, 2632-2638, 2660, 2676-2679, 2688-2693, 2707, 2721, 2757-2762, 2779, 

25 2794, 2795, 2797-2799, 2801, 2802, 2817-2843, 2863-2867, 2878-2884, 2886- 
2895. 
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The above objects aie also achieved using HCV q>itopes of the 

fonnula 

aa^-aay 

wh^in aa denotes an amino acid; 
5 X and y are integers such that y-x ^ 6; 

aa^-aa^ indicates a portion of the amino add sequence of Figure 1; 
and X is selected from the group consisting of 35 (where y is less than 

45), 80 (where y is less than 90), 95 (wh^e y is less than 110), 99 (where y is 
less than 120), 100 (where y is less than 150), 190 (where y is less than 210), 500 

10 (where y is less than 550), 600 (where y is less than 625), 1260 (where y is less 
than 1280), 1569 (where y is less than 1931), 1570 (where y is less than 1590), 
1694 (where y is less than 1735), 1949 (where y is less than 2124), 1950 (where y 
is less than 1985), 2000 (where y is less than 2050), 2005 (where y is less than 
2025), 2054 (where y is less than 2223), 2250 (where y is less than 2330), 2287 

15 (where y is less than 2385), 2290 (where y is less than 2310), 2345 (where y is 
less than 2375), 2348 (where y is less than 2464), 2445 (where y is less than 
2475), 2470 (where y is less than 2490), 2605 (where y is less than 2620), 2780 
(where y is less than 2830), 2796 (where y is less than 2886), 2800 (where y is 
less than 2850), and 2885 (where y is less than 2905). 

20 In either of the above fonnula, x-y can less than or equal to 10, 20, 

30, 40 or 50 in some embodiments of the invention. 

Brief Description of the Drawings 

Fig. 1 shows the polyprotein of the HCV prototype isolate HCVl. 
25 Fig. 2 shows a composite cDNA sequence for HCVl. 

Fig. 3 shows the nucleotide consensus sequence of human isolate 
23, variant sequences are shown below the sequence line. The amino acids 
encoded in the consensus sequence are also shown. 

Fig. 4 shows the nucleotide consensus sequence of human isolate 
30 27, variant sequences are shown below the sequence line. The amino adds 
encoded in the consensus sequence are also shown. 
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Fig. 5 shows the aligned nucleotide sequences of human isolates 23 
and 27 and of HCVl. Homologous sequences are indicated by the symbol (*). 
Non-homologous sequences aie in small letters. 

Fig. 6 shows the aligned amino add sequences of human isolates 23 
5 and 27 and of HCVl. Homologous sequences are indicated by the symbol (*). 
Non-homologous sequences aie in small letters. 

Fig. 7 shows a comparison of the com|x>site aligned nucleotide 
sequences of isolates Thorn, ECl, HCT #18, and HCVl. 

Fig. 8 shows a comparison of the nucleotide sequences of EClO and 
10 a composite of the HCVl sequence; the EClO sequence is on the line above the 
dots, and the HCVl sequence is on the line below the dots. 

Kg. 9 shows a comparison of the amino add sequences 117-308 
(relative to HCVl) encoded in the "EnvL" regions of the consensus sequences of 
human isolates HCT #18, JH23, JH 27, Thome, ECl, and of HCVl. 
15 Fig. 10 shows a conq}arison of the amino add sequences 330-360 

(relative to HCVl) encoded m the "EnvR" regions of the consensus sequences of 
human isolates HCT #18, JH23, JH 27, Thome, ECl, and of HCVl. 

Modes for Carrying Out the Invention 

20 

Complete dtations to publications refened to herein can be found in 
the "Background" or "Bibliography" sections. 
L Definitions 

"Hepatitis C vims" or "HCV" refers to the art-recognized viral 
25 spedes of which pathogenic strains cause NANBH, and attenuated strains or 

defective interfering particles derived therefrom. See generally, publications dted 
in the section entitled "Background." The HCV genome is comprised of RNA. It 
is known that RNA containing viruses have relatively high rates of spontaneous 
mutation, Le,, lepoitedly on the order of 10*^ to 10^ per incorporated nucleotide 
30 (Fields & Kiiipe (1986)). Therefore, smce heterogeneity and fluidity of geno^rpe 
arc inherent in RNA viruses, there are multiple strains/isolates, which may be 
virulent or avirulent, within the HCV spedes. The propagation, identification. 
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detection^ and isolation of the various HCV strains or isolates has been well 
documented in the literature. Moreover, the disclosure hereta allows the 
preparation of diagnostics and vaccines for the various strains/isolates, as well as 
compositions and methods that have utility in screening procedures for anti-viral 
5 agents for pharmacologic use, such as agents that inhibit replication of HCV. 

Information on several diffisreot strains/isolates of HCV is disclosed 
herein, particulariy strain or isolate CDC/HCVl (also called HCVl). Mormation 
from one strain or isolate, such as a partial genomic or amino acid sequence, is 
sufficient to allow those skilled in the art using standard techniques to isolate new 
10 strains/isolates and to identify whether such new strains/isolates are HCV. For 
example, several different strains/isolates are described below. These strains, 
which were obtained from a rmmber of human sent (and from different 
geographical areas), were isolated utilizing the information from the genomic 
sequence of HCVl, 

15 The information provided herein is indicative that HCV may be 

distantiy related to the flaviviridae. The Flavivirus femily contains a large number 
of viruses which are small, enveloped pathogens of man. The morphology and 
composition of Flavivirus particles are known, and are discussed in Brinton 
(1986). Generally, with respect to morphology, Flaviviruses contain a central 

20 nucleocapsid surrounded by a lipid bilayer. Virions are spherical and have a 

diameter of about 40-50 imi. Their cores are about 25-30 nm in diameter. Along 
the outer surface of the virion envelcpe are projections that are about 5-10 nm 
long with terminal knobs about 2 nm in diam^r. Typical eTcamples of the family 
include Yellow Fever virus. West Nile virus, and Dengue Fever virus. They 

25 possess positive-stranded RNA genomes (~ 11,000 nucleotides) that are slightiy 
larger than that of HCV and encode a polyprotein precursor of about 3500 amino 
acids. Individual viral proteins are cleaved from this precursor polypeptide. 

The genomic structure and the nucleotide sequence of HCV genomic 
RNA has t>een deduced. The genome appears to be single-stranded SNA con- 

30 taining - 10,000 nucleotides. Hie genome is positive-stranded, and possesses a 
continuous, translational open reading frame (OSF) that encodes a polyprotein of 
about 3,000 amino adds. In the ORF, the stmctural protein(s) appear to be 



SUBSTITUTE SHEET 



wo 93/00365 



PCr/US92/05388 



10 



encoded ia approxiniatBly the first qoaiter of the N-tenninns region, with the 
majority of the polypiotem responsible for non-structural proteins. When 
compared with all known viial sequences, smafl but significant co-linear 
homologies are observed with the non-structural proteins of die flavivinis family, 
and with the pestiviruses (which are now also considered to be part of the 

Flavivirus family). 

Based upon the putative amino adds encoded in the nucleotide 
sequence of HCVl and other evidence, possfljle protein domains of the encoded 
HCV polyprotein, as wefl as the approximate boundaries, are the fbllowing: 



Pntative Domain 



A pproximat e Boundary 
faminn acid nos.> 



15 



C (nudeocapsid protein) 
El (virion envelope protein) 



1-191 
192-383 



20 



25 



Ej/NSl (aivdope?) 
NS2 (unknown function) 
NS3 (protease?) 
NS4 (unknown function) 
NS5 (polymerase) 



384-800 

800-1050 

1050-1650 

1651-2100 

2100-3011(end!) 
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These domains axe, however, tentative. For example, the E1-NS2 border is 
probably in the 750-810 r^on, and NS3-NS4 border is about 1640-1650. There 
is also evidrace that the 191 aa version of C is a precursor that is further 
5 processed (e.g., to about 170 aa m length), and that the NS2, NS4 and NS5 
proteins are each further processed into two mature proteins. 

Different strains, isolates or subtypes of HCV are ejqjected to 
contain variations at the amino add and nucleic adds compared with HCVl. 
Many isolates are expected to show much (i.e., more than about 40%) homology 

10 m the total amino add sequoice compared with HCVl . However, it may also be 
found that there are other less homologous HCV isolates. These would be defined 
as HCV according to various criteria such as, for example, an OKF of 
proximately 9,0(X) nucleotides to approximately 12,000 nucleotides, encoding a 
polyprotein similar in size to that of HCVl, an encoded polyprotein of similar 

15 hydrophobic and/or antigenic character to that of HCVl, and the presence of co- 
linear peptide sequences that are conserved with HCVl. In addition, the genome 
would be a positive-stranded RNA. 

HCV encodes at least one epitope which is immunologically 
identifiable with an epitope in the HCVl polyprotein. The epitope is unique to 

20 HCV when compared to previously known Flaviviruses. The uniqueness of the 
q)itope may be determined by its immunological reactivity with anti-HCV anti- 
bodies and lack of immunological reactivity with antibodies to known Flavivirus 
spedes. Methods for determining immunological reactivity are known in the art, 
for example, by radioimmunoassay, by ELIS A assay, by hemagglutination, and 

25 several examples of suitable techniques for assays are provided herein. 

Alternatively, a comparison of the sequence of the HCV epitope to previously 
known sequences of members of the Flavivirus family can be used to evaluate 
"uniqueness." 

In addition to the above, the following parameters of nudeic add 
30 homology and amino acid homology are applicable, either alone or in 

combination, in identifying a strain/isolate as HCV. Since HCV strains and 
isolates are evolutionarily related, it is e?q>ected that the overall homology of the 
genomes at the nucleotide level may be about 10% or greater, probably will be 
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about 4096 or greater, probably about 60% or greater, and even more probably 
about 80% or greater; and in addition tiiat diere will be corresponding contiguous 
sequences of at least about 13 nucleotides. It should be noted that variable and 
hypervariable regions within the HCV genome; therefore, the homology in these 
5 legrons is aqwcted to be significantly less than that in the overall genome. The 

correspondence between the putative HCV strain gaiomic sequence and, for 
example, the CDC/HCVl cDNA sequence can be determined by techniques known 
inthearL For example, they can be detramined by a direct comparison of tfie 
sequence information of the polynucleotide from the putative HCV, and the HCV 

0 cDNA sequMce(s) described herein. For example, also, they can be determined 
by Itybridization of the polynucleotides under conditions which fonn stable 
duplexes between homologous r^ons (for exanq)le, those which would be used 
prior to Si digestion), followed by digestion with single stranded specific 
nuclease(s), followed by size determination of the digested firagments. 

5 Because of the evolutionary relationshq) of the strains or isolates of 

HCV, putative HCV strains or isolates are identifiable by their homology at the 
polypeptide level. Generally, HCV strains or isolates are eqwcted to be at least 
10% homologous, more than about 40% homologous, probably more than about 
70% homologous, and even more probably more than about 80% homologous, and 

[) some may even be more than about 90% homologous at the polypeptide level. 
The techniques for determining amino acid sequence homology are known in the 
art. For example, the amino acid sequence may be determined directly and 
compared to die sequences provided herem. Alternatively the nucleotide sequence 
of the genomic material of the putative HCV may be determined (usually via a 

) cDNA intermediate), the amino add sequence encoded therem can be determined, 
and the corresponding regions compared. 

As used herem, a polynucleotide "derived from" a designated 
sequrace refers to a polynucleotide sequence which is comprised of a sequence of 
sqjproximately at least about 6 nucleotides, preferably at least about 8 nucleotides, 

3 more preferably at least about 10-12 nucleotides, and even more preferably at least 
about 15-20 nucleotides conespondiag to a region of the designated nucleotide 
sequence. "Corresponding" means homologous to or complementary to the 
designated sequence. Preferably, the sequence of the region from which the 
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polynucleotide is derived is homologous to or compl^entaiy to a sequence which 
is unique to an HCV genome. Whether or not a sequrace is unique to the HCV 
genome can be determined by techniques known to those of skill in the art. For 
exanq)Ie, the sequence can be compared to sequences in databanks (as of the 
5 priority date), e.g, , Genebank, to d^rmine whedier it is present in the uninfected 
host or other organisms. The sequence can also be compared to the known (as of 
the priority date) sequences of other viral agmts, mcluding those which are known 
to induce hq>atitis, e.g. , HAV, HBV, and HDV, and to members of the 
Flaviviridae. The correspondence or non-correspondence of the derived sequence 

10 to other sequences can also be detennined by hybridization under the appropriate 
stringency conditions. Hybridization techniques for determining the 
complementarity of nucleic add sequences are known in the ait. See, for 
exan^le, Maniatis ei al (1982). In addition, mismatches of diq>lex poly- 
nucleotides fonned by hybridization can be detennined by known techniques, 

15 includmg for example, digestion with a nuclease such as SI that specifically 
digests single-stranded areas in duplex polynucleotides. Regions from which 
typical DNA sequences may be "derived" include but are not limited to, for 
example, regions encoding specific qntopes, as weU as non-transcribed and/or 
non-translated regions. 

20 Tbe derived polynucleotide is not necessarily physically derived 

from the nucleotide sequence shown, but may be generated in any manner, 
including for example, chemical synthesis or DNA rq)lication or reverse 
transcription or transcription. In addition, combinations of regions corresponding 
to that of the designated sequence may be modified in ways known in the art to be 

25 consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence "derived from" a 
designated amino acid or nucleic acid sequence refers to a polypeptide having an 
amino acid sequence identical to that of a polypeptide encoded in the sequence, or 
a portion thereof wherein the portion consists of at least 3-5 amino acids, and 

30 more preferably at least 8-10 amino acids, and even more preferably at least 11-15 
amino acids, or which is immunologically identifiable with a polypeptide encoded 
in the sequence. This terminology also includes a polypeptide expressed from a 
designated nucleic acid sequence. 
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A recombinant or derived polypqrtide is not necessarily translated 
fixnn a designated nucleic add sequence; it may be generated in any manner, 
including for exaniple, cliemical syntiiesis, or expression of a recombinant 
eaqpression system, or isolation from HCV, including mutated HCV. A lecombin- 
5 ant or derived polypeptide may include one or more analogs of amino adds or 
unnatural amino acids in its sequaice. Methods of inserting analogs of amino 
adds into a sequence are known in the art. tt also may include one or more 
labels, wliich are known to tliose of skill in the art. 

The term "recombinant polynndeotide" as used herein intends a 
10 polynndeotide of genomic, cDNA, semisynthetic, or synthetic origin which, by 
virtue of its origin or manipilation: (1) is not associated with all or a portion of a 
polynucleotide with which it is associated in nature, (?) is linked to a 
polynucleotide other than that to which it is linked in nature, or (3) does not occur 
in nature. 

15 The tenn "polynucleotide" as used herein refers to a polymeric form 

of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. Hiis 
tenn refers only to the primary stracmre of the molecule. Thus, this term 
includes double- and single-stianded DNA and RNA. It also mchides known types 
of modifications, for example, labels which are known in the art, methylation, 

20 "c^s", substitution of one or more of the naturafly occurring nucleotides with an 
analog, intemncleotide modifications such as, far example, those wifli uncharged 
linkages ie.g., methyl phosphonates, phosphotriestCTS, phosphoramidates, 
carbamates, etc.) and with charged linkages {e.g., phosphorothioates, phosphoro- 
dithioates, etc.), those containing pendant moieties, such as, for example proteins 

25 (including for e.g. , iracleases, toxins, antibodies, signal peptides, poly-L-lysine, 
etc.), those with intercalators (e.g., acridine, psoralen, etc.), those containmg 
dielalors (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those 
containing alkylators, those with modified linkages (e.g. , alpha anomeric nucleic 
acids, etc.), as well as unmodified forms of the polynndeotide. 

30 A "purified" polypeptide refers to the polypeptide being in a state 

that is substantially free of other polypeptides, f.e. , in a composition that contains 
a minimum of about 50% by weight (desired polypeptide/total polyp^tide in 
composition), prderably a mmimum of about 70%, and even more preferably a 
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mifiimiifn of about 90% of the desixed polypeptide, without regard to non- 
proteinaceous materials in the composition. Techniques for purifying viral poly- 
peptides are known in the ait. Purified antibodies are similarly defined. 

"Recombinant host cells", "host cells", "cells", "cell lines", "cell 
S cultures" , and other such terms denoting microorganisms or higher eukaiyotic cell 
lines cultured as unicellular ^titles refer to cells which can be, or have been, used 
as r6C^lie^ts for recombinant vector or other transfer DNA, and include the 
progeny of the original cell which has been transfected. It is understood that the 
progeny of a single parental cell may not necessarily be completely identical in 

10 moq)hology or in genomic or total DNA complement as the original parent, due to 
natural, accidental, or deliberate mutation. 

A "replicon" is any genetic element, e.^., a plasmid, a 
chromosome, a virus, a cosmid, etc. that behaves as an autonomous unit of 
polynucleotide replication within a cell; Le. , equable of replication under its own 

IS control. 

A "vector" is a replicon in which another polynucleotide segment is 
attached, so as to bring about the replication and/or expression of the attached 
segment. 

"Control sequence" refers to polynucleotide sequences which are 
20 necessary to effect the e}q}ression of coding sequences to which they sie ligated. 
The nature of sach control sequences differs dqiending upon the host organism; in 
prokaiyotes, such control sequences generaUy include promoter, ribosomal binding 
site, and temiinators; in eukaryotes, generally, such control sequences include 
promoters, terminators arKl, in some instances, mbancers. The term "control 
25 sequences" is intraded to include, at a minimum, all components whose presence 
is necessary for expression, and may also include additional components whose 
presence is advantageous, for example, leader sequences. 

"Opeiably linked" refers to a juxtaposition wherein the components 
so described are in a relationship permitting them to function in their intended 
30 maimer. A control sequence "operably linked" to a coding sequence is ligated in 
such a way that e?q)ression of the coding sequence is achieved under conditions 
compatible with the control sequences. 
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An "open reading fiame" (OEF) is a i^on of a polynucleotide 
sequence wMch encodes a polypeptide; tins region may lepresent a portion of a 
coding sequence or a total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is 
5 transcribed into mRNA and/or translated into a polypeptide when placed under tiie 
control of ^ropiiate regulatory sequences. The boundaries of tiie coding 
sequence are detennined by a translation start codon at tiie 5'-tenninus and a 
translation stop codon at tiie 3'-terminus. A coding sequence can include, but is 
not limited to mRNA, cDNA, and recombinant polynucleotide sequences. 

10 -Lnmunologically identifiable wifli/as" refers to die presence of 

epitope(s) and polypq)tides(s) wMch are also present in tiie designated 
polypeptide(s), usuafly HCV proteins. Immunological identity may be determined 
by antibody binding and/or competition in binding; tiiese techniques are known to 
those of average skill in the art 

15 As used herein, "epittipe" refers to an antigenic determinant of a 

polypeptide. An epitope could comprise 3 or more amino acids tfiat defines tiie 
binding site of an antibody. Generally an epitope consists of at least 5 amino 
acids, and sometimes consists of at least 8 amino adds. Metfiods of q)itope 
nuqpping are known in tiie art. 

20 A polypeptide is "immunologically reactive" witii an antibody when 

it hinds to an antibody due to antibody recognition of a specific epitope contained 
witiiin tiie polypeptide. Immunological reactivity may be determined by antibody 
binding, more particularly by tiie kinetics of antibody binding, and/or by competi- 
tion in binding using as conq)etitor(s) a known polypeptide(s) containing an epitope 

25 agamst which tiie antibody is directed. The techniques for determming wheflier a 
polypeptide is immunologically reactive witii an antibody are known in tiie art. 

As used herein, the term "antibody" refers to a polypqitide or group 
of polypeptides which are comprised of at least one antibody combining site. An 
"antibody combining site" or "binding domain" is formed from tiie folding of vari- 

30 able domains of an antibody molecule(s) to fonn tiiree-dimensional binding spaces 
with an internal surface shape and charge distribution complementary to tiie 
features of an epitope of an antigen, which aUows an immunological reaction witfi 
the antigen. An antibody combining site may be formed from a heavy and/or a 
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light chain domain (Vh and Vl, respectively), which fonn hypervaiiable loops 
which contribute to antigen binding. Hie term ''antibody" includes, for example, 
vertebrate antibodies, hybrid antibodies, chimeric antibodies, altered antibodies, 
univalent antibodies, the Fab proteins, and single domain antibodies. 
S As used herein, a "smgle domain antibody" (dAb) is an antibody 

which is comprised of an VH domain, which reacts immunologically with a 
designated antigen. A dAb does not contain a V^, domain, but may contain other 
antigen binding domains known to ^dst in antibodies, for example, the kappa and 
lambda domains. Methods for prefming dAbs are known in the art. See, for 
10 example. Ward et oL (1989). 

Antibodies may also be comprised of Vh and V^ domains, as well as 
other known antigen binding domains. Examples of these types of antibodies and 
methods for their prq>aiadon arc known in the art (see, e.g. , U.S. Patent No. 
4,816,467, which is incorporated herem by reference), and include the following. 
15 For example, "vertebrate antibodies" refers to antibodies which are tetiamers or 
aggregates thereof, comprising light and heavy chains which are usually 
aggregated in a configuration and which may or may not have covalent link- 
ages between the chains. In vertebrate antibodies, the amino add sequences of all 
the chains of a particular antibody are homologous with the chains found in one 
20 antibody produced by the lymphocyte which produces that antibody in situ, or in 
vitro (for example, in hybridomas). V^tebrate antibodies typically include native 
antibodies, for example, purified polyclonal antibodies and monoclonal antibodies. 
Examples of the m^ods for the preparation of these antibodies are described 
infra. 

25 "Hybrid antibodies" are antibodies wherein one pair of heavy and 

light chains is homologous to those in a iint antibody, while the other pair of 
heavy and light chains is homologous to those in a different second antibody. 
Typically, each of these two pairs will hind different epitopes, particulariy on 
different antigens. This results in the property of "divalence", i.e., the ability to 

30 bind two antigens simultaneously. Such hybrids may also be formed using 
chimeric chains, as set forth below. 

"Chimeric antibodies", are antibodies in which the heavy and/or 
light chains are fusion proteins. Typically the constant domain of the chains is 
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from one particular species and/or class, and the vadable domains are from a 
different species and/or class. Also included is any antibody in which either or 
both of the heavy or light chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different sources, whether these sources 
5 be differing classes, or different species of origin, and whether or not the fiision 
pomt is at the variable/constant boundary. Thus, it is possible to produce 
antibodies in which neither the constant nor the variable region mimic known anti- 
body sequences. It then becomes possible, for example, to construct antibodies 
whose variable region has a higher specific afSnity for a particular antigen, or 
10 whose constant region can elicit enhanced complement fixation, or to make other 
improvements in properties possessed by a particular constant region. 

Another example is "altered antibodies", which refers to antibodies 
in which the naturafly occurring amino add sequence in a vertebrate antibody has 
been varied. Utilizing recombinant DNA techniques, antibodies can be redesigned 
15 to obtain desired characteristics. The possible variations are many, and range 
from the changing of one or more amino adds to the complete redesign of a 
region, for exaniple, the constant region. Changes in the constant region, in 
general, to attain desked ceflular process characteristics, e.g., changes in com- 
plement fixation, interaction with membranes, and other effector functions. 
20 Changes in the variable region may be made to alter antigen binding 

characteristics. The antibody may also be engineered to aid the specific delivCTy 
of a molecule or substance to a specific cell or tissue site. The desired alterations 
may be made by known techniques in molecular biology, e.^., recombinant 
techniques, site directed mutagenesis, etc, 
25 Yet another example are "univalent antibodies", which are 

aggregates comprised of a heavy chain/light chain dimer bound to the Fc (z.e. , 
constant) region of a second heavy chain. This type of antibody esc25)es antigenic 
modulation. See, e.g., Glennieef ai. (1982). 

Induded also within the definition of antibodies are "Fab" fragments 
30 of antibodies. The "Fab" region refers to those portions of the heavy and light 
chains which are roughly equivalent, or analogous, to the sequences which 
comprise the branch portion of the heavy and light chains, and which have been 
shown to exhibit immunological binding to a specified antigen, but which lack the 
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effector Fc portian. "Fab" includes aggregates of one heavy and one light chain 
(commonly known as Fab'), as well as tetramers containing the 2H and 2L chains 
(referred to as F(ab)2), which are capable of selectively reacting with a designated 
antigm or antigen family. "Fab" antibodies may be divided into subsets analogous 
5 to those described above, i.e, "vertebrate Fab", "hybrid Fab", "chimeric Fab", 
and "altered Fab". Methods of producing "Fab" fragments of antibodies are 
known within the ait and include, for example, proteolysis, and synthesis by 
recombinant techniques. 

Also included in the term "antibodies" are single-chain antigen- 
10 binding (SCA) proteins, such as the type described in the article co-authored by 
Schlom, J. in the June 15, 1992 issue of Cancer Research (as well as articles cited 
therein). 

As used herein, the term "inmiunogenic polypeptide" is a 
polypqitide that elicits a cellular and/or humoral immune response, whether alone 

15 or linked to a carrier in the presence or absence of an adjuvant. 

The term "polypeptide" refers to a polymer of amino adds and does 
not refer to a specific length of the product; thus, pq)tides, oligopeptides, and 
proteins are included within the definition of polypq>tide. This term also does not 
refer to or exclude post-expression modifications of the polypeptide, for example, 

20 glycosylation, acetylarion, phosphorylation and the like. Included within the 
definition are, for example, polypeptides containing one or more analogs of an 
amino acid (including, for example, unnatural amino adds, etc.), polypeptides 
with substituted linkages, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. 

25 "Transformation", as used herein, refers to the insertion of an 

exogenous polynudeotide into a host cell, irrespective of the method used for the 
insertion, for example, direct uptake, transduction, f-mating or dectroporation. 
The exogenous polynucleotide may be maintained as a non-integrated vector, for 
example, a plasmid, or alternatively, may be integrated into the host genome. 

30 "Treatment" as used herein refers to prophylaxis and/or therapy. 

An "individual", as used herein, refers to vertebrates, particularly 
members of the manmialian spedes, and includes but is not limited to animals 
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(e.g-, dogs, cats, cattle, swine, sheep, goat, rabbits, mice, rats, guinea pigs, etc.), 
and piimates, including monkeys, chimps, baboons and humans. 

As used herein, the "sense stiand" of a mideic acid contains the 
sequence that has sequence homology to that of mRNA. The "anti-sense stiand" 
5 contains a sequrace which is complementary to that of the "sense strand". 

As used herein, a "positive stranded genome" of a virus is one in 
which the genome, whedier UNA or DNA, is single-stranded and which encodes a 
viral polypeptide(s), Exanqiles of positive stranded RNA viruses mclude 
Togaviridae, Coionaviridae, Retroviridae, Picomaviridae, and Caliciviridae. 
10 Included also, are the FlaviviridaB, which were formerly classified as Togaviradae. 
See Fields & Knipe (1986). 

As used herein, "antibody containing body component" refers to a 
component of an individual's body which is a source of the antibodies of interest. 
Antibody containing body componmts are known in the art, and include but are 
15 not limited to, for example, plasma, serum, spinal fluid, lyn5)h fluid, the external 
sections of the respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, 
white blood cells, and myelomas. 

As used herein, a "biological sample" refers to a sample of tissue or 
fluid isolated from an individual, including but not limited to, for exanqjle, 
20 plasma, serum, spinal fluid, lymph fluid, the external sections of the skin, respir- 
atory, intestinal, and gCTitourinary tracts, tears, saliva, milk, blood cells, tumors, 
organs, and also samples of in vitro cell culture constituents (including but not 
limited to conditioned medium resulting from the growth of cells in cell culture 
medium, putatively vixally infected cells, recombinant cells, and cell components). 

25 

n. Description of the Invention 

The practice of the present invention wiH employ, unless otherwise 
indicated, conventional techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the skill of the art. Such 
30 techniques are explsmod fully in the literature. See e.g. , Maniatis, Fitsch & 

Sambrook, "Molecular Cloning; A Laboratory Manual" (1982); "DNA Cloning, 
Volumes I and II" (D.N Glover ed. 1985); "Oligonucleotide Synthesis" (M.J. Gait 
ed, 1984); "Nucleic Add Hybridization" (B.D. Hames & SJ. Higgins eds. 1984); 
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"Transcnptioii and Tnmslation'' (B.D. Hames & SJ. Biggins eds. 1984); 
"Animal Cell Culture" (R.L Freshney ed. 1986); Immobilized Cells And 
Enzymes" (IRL Press, 1986); B. Peibal, "A Practical Guide To Molecular 
Cloning" (1984); the series, "Methods in Bizymology" (Academic Press, Inc.); 
5 "Gene Transfer Vectors For Mammalian Cells" (J.H. ^filler and M.P. Calos eds. 
1987, Cold Spring Harbor Laboratoiy), Meth Enrymnl VoL 154 and Vol. 155 
(Wu and Grossman, and Wu, eds., respectively), Mayer and Walker, eds. (1987), 
"Immunochemical Methods In Cell And Molecular Biology" (Academic Press, 
London); Scopes, (1987) "Protein Purification: Principles and Practice", Second 
10 Edition (Springer-Veriag, N.Y.); and "Handbook of E)qjerimental Immunology", 
Volumes I-IV (D.M. Weir and C, C. Blackwell eds 1986). All patents, patent 
applications, and publications mentioned herein, both supra and infra, are hereby 
incorporated herein by reference. 

15 n.A. Truncated HCV Polypeptides 

The useiul materials and processes of the present invention are 
made possible by the identification below of new HCV epitopes. The knowledge 
of these epitopes (or antigenic regions) allows for construcdon of polypeptides 
containing truncated HCV sequences winch can be used as immunological 

20 reagrats. 

Truncated HCV amino add sequences encoding at least one viral 
q>itope are useful immunological reagents. For example, polypeptides comprising 
such truncated sequences can be used as reagents in an immimoassay. These 
polypeptides also are candidate subunit antigens in compositions for antiserum 

25 production or vaccines. While these truncated sequences can be produced by 

various known treatments of native viral protein, it is generally preferred to make 
synthetic or recombinant polypeptides comprising an HCV sequence. Polypeptides 
comprising these truncated HCV sequences can be made up entirely of HCV 
sequences (one or more epitopes, either contiguous or noncontiguous), or HCV 

30 sequences and heterologous sequences in a fusion protein. Useful heterologous 
sequences include sequences that provide for secretion from a recombinant host, 
enhance the immunological reactivity of the HCV epitope(s), or facilitate the 
coupling of the polypeptide to an immunoassay support or a vaccine carrier. See, 
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eg., EPO Pub. No. 116.201; U.S. Pat. No. 4.722,840; EPO Pub. No. 259.149; 
U.S. PaL No. 4,629.783, the disdosuies of which are incoipoiated hraein by 
lefia^ce. 

Hie size of pdypqrtides comprising the tnmcated HCV sequences 
5 can vaiy widdy, the minimum si2B being a sequence of sufficient size to provide 
an HCV qiitope, while the maximum size is not criticaL For convenience, the 
maximum size usuaUy is not substantiany greater than that required to provide the 
desired HCV qatopes and function(s) of the heterologous sequence, if any. 
Typically, the tnmcated HCV amino add sequoice will range from about 5 (or 8) 

10 to about 100 amino adds in length. More typically, however, the HCV sequence 
win be a maximnm of about 50 (or 40) amino adds in length, and sometimes a 
maximum of about 20, 25 or 30 amino adds. It is usually desirable to sdect 
HCV sequences of at least about 8, 10, 12 or 15 amino adds. 

Exanqiles of truncated HCV amino add sequences (octamers) that 

15 are useful as described herem are set forth bdow in the exan^iles. tt is to be 

understood that these peptides do not necessarily precisely map one epitope. Non- 
immunogenic portions of the sequence can be defined using conventional 
techniques and deleted from die described sequences. Further, additional trun- 
cated HCV amino add sequmces that comprise an epitope or are immunogenic 

20 can be identified as desraibed herein. 

Polypeptide products containing the tnmcated HCV amino acid 
sequences disclosed below can be prepared as discrete peptides or incoiporated 
into a larger polypqptide, and may find use as descanbed herein. Li prefOTed 
applications, truncated sequences from tiie El and/or E2 domains have aRpUcations 

25 in vacdne and tiierq)eutic products. While gaierally any of the domains can have 
some diagnostic utility, C, NS3, NS4 and NS5 are particulariy preferred, wifli 
combinations of C epitopes with q)itapes from one or more of tiie NS3, NS4 or 
NS5 domains being particulariy preferred. 

30 ILB. Preparation n f Polypeptides 

The availability of DNA sequences encoding HCV amino acid 
sequences permits tiie construction of expression vectors encoding antigenically ac- 
tive regions of tiie poIypq)tide (See, e.g., Fig. 2). These antigenically active 
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regions may be derived from coat or envelope antigens or from core antigens^ or 
from antigens wbich are non-structural including, for example, polynucleotide 
binding proteins, polynucleotide polymerase(s), and other viral proteins required 
for the replication and/or assembly of the virus particle. Fragments encoding the 
5 desired poiypqptides are derived, for exampl , from viral cDNA clones using 
conventional restriction digestion or by synthetic methods, and are ligated into 
vectors which may, for example, contain portions of fusion sequmces such as 
i?-Galactosidase or superoxide dismutase (SOD), preferably SOD. Methods and 
vectors which are useful for the production of polypeptides which contain fusion 

10 sequences of SOD are described in European Patent Office Publication numb^ 
0196QS6, published October 1, 1986. Vectors encoding fusion polypeptides of 
SOD and HCV polypeptides, i.e., NANB3.H, NANBgi, and ClOO-3, which is 
encoded in a composite of HCV cDNAs, are described in Sections IV.B.l, 
IV.B.2, and IV.B.4, respectively. Any desired portion of the HCV cDNA 

15 containing an open reading frame (or a synthetic version thereof), can be used to 
express a recombinant polypq)tide, such as a mature or fusion protein. 

Alternatively, a polypeptide contain HCV epitopes can be provided 
by chemical synthesis using standard techniques based on the amino acid sequence 
of the figures and the examples. 

20 The DNA encoding the desired polypeptide, whether in fused or 

unfiised fonn, and whether or not containing a signal sequrace to permit secretion, 
may be ligated into e^qnession vectors suitable for any convenient host. Both 
eukaiyotic and prokaryotic host systems are presmtly used in forming recombinant 
polypeptides, and host cell lines is given in BPO Pub. Nos. 318,216. The 

25 polypeptide is then isolated from lysed cells or from the culture medium and pur- 
ified to the extent needed for its intended use. Purification may be by techniques 
known in the art, for example, differratial extraction, salt fractionation, 
chromatogrq)hy on ion exchange resins, affinity chromatogr^hy, centrifiigation, 
and the like. See, for example, Methods in Enzymology for a variety of methods 

30 for purifying proteins. Such polypeptides can be used as diagnostics, or those 
which give rise to neutraliziag antibodies may be formulated into vaccines. Anti- 
bodies raised against these polypeptides can also be used as diagnostics, or for 
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passive immunotheiapy. In addition, as discussed below, antibodies to these 
polypeptides are useful for example in isolating and identifying HCV paitides. 

The HCV polypq)tides may also be isolated from HCV virions and 
truncated ^not already). The virions may be grown in HCV infected cells in 
5 tissue cultoxe, or in an infected host. 

ILC. Prgparatiog of Antigenic Poly peprides and roniiiTOtTon with Carrigf 
An antigenic region of a polypq)tide is generally relatively 
small-typically 8 toIO amino adds or less in length. Fragments of as few as 5 

10 amino adds may characterize an antigenic region. These segments may cor- 
respond to regions of HCV antigen. Accordingly, using the cDNAs of HCV as a 
basis, DNAs encoding short segments of HCV polypeptides can be expressed 
recombinanfly either as fusion proteins, or as isolated polypeptides. In addition, 
short amino add sequences can be convenientty obtained by chemical synthesis. 

15 111 instances wherein the synthesized polypeptide is conecfly configured so as to 
provide the correct epitope, but is too small to be immunogenic, the polypq>tide 
may be linked to a suitable carrier. 

A number of techniques for obtaining such linkage are known in the 
art, including the formation of disulfide linkages using N-sucdnimidyl-3-(2- 

20 pyridyIfliio)prppionate (SPDP) and sucdnimidyl 

4-(N-maleimido-methyl)cyclohexane-l-carboxyIate(SMCQ obtained from Pierce 
Company, Rockford, Illinois, (if the peptide lacks a sulfhydryl group, tiiis can be 
provided by addition of a cystdne residue.) These reagraits create a disulfide 
linkage between tiiMnselves and peptide qrsteine residues on one protein and an 

25 amide linkage tinougji tiie epsilon-amino on a lysine, or other free amino group in 
the other. A variety of such disulfide/amide-forming agents are known. See, for 
example, Tminiin Rev (1982) ^:185. Ctther bifimctional coupling agents form a 
tiiioether rather than a disulfide linkage. Many of fliese tiiio-ether-fonning agents 
are commerdally available and include reactive esters of 6-maleinudocaproic acid, 

30 2-bromoacetic acid, 2-iodoacetic acid, 

4-(N-maleimido-metiiyl)cyclohexane-l-carboxylic acid, and tiie like. The carbo^l 
groups can be activated by combining tiiem witii succinimide or l-hydroxyl-2 
mtix>-4-suIfonic add, sodium salt. Additional metiiods of coupling antigens 
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employs the tDtavinis/''famdtng pqjtide" system described in EPO Pub» No. 
259,149, the disclosure of which is incorporated herein by reference. The 
forcing list is not meant to be exhaustive, and modifications of the named 
compounds can cleady be used. 
5 Any carrier may be used which does not itself induce the production 

of antibodies harmful to the host Suitable carriers are typically large, slowly 
m^abolized macromolecules such as proteins; polysaccharides, such as latex 
iunctionalized Sepharose®, agarose, cellulose, cellulose beads and the like; 
polymeric amino acids, such as polyglutamic add, polylysine, and the like; amino 
10 add cqx>lymers; and inactive virus particles, see, for example. Section n.D. 
Especially useful protein substrates axe serum albumins, keyhole limpet 
hemocyanin, inununoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, 
and other proteins well known to those skilled in the art. 

In addition to fiill-length viral protems, polypq)tides comprising 

15 

n.D. Preparation of Hvbrid Particle Immunopens ContainiP B nrv lipUopes 

The immunogenidty of the epitopes of HCV may also be ^ihanced 
by preparing them in mammalian or yeast systems fused with or assembled with 
particle-forming proteins such as, for example, that associated with hepatitis B 

20 surface antigen. See, e.^., US 4,722,840. Constmcts wherein the HCV epitope 
is linked directly to the particle-forming protein coding sequences produce hybrids 
which are inununogenic with respect to the HCV qntope. In addition, all of the 
vectors prepared indude q)itqpes specific to HBV, having various degrees of 
immunogenidty, such as, for example, the pre-S pq)tide. Thus, particles 

25 constructed from particle forming protein which include HCV sequences are im- 
munog^c with respect to HCV and HBV. 

Hq>atitis surface antigen (HBSAg) has been shown to be formed and 
assembled into particles in S. cerevisiae (P. Valenzuela et al. (1982)), as well as 
in, for example, mammalian cells (P. Valenzuela et al. (1984)). The formation of 

30 such particles has been shown to enhance the immunogenicity of the monomer 

subunit. The constructs may also include the immunodominant epitope of HBSAg, 
comprising the 55 amino adds of the presurface (pre-S) region. Neurath et al 
(1984). Constructs of the pre-S-HBSAg particle expressible in yeast are disclosed 



wo 93/00365 



-24- 



PCr/US92/0S388 



in EPO 174,444, published March 19, 1986; hybrids indoding heterologous viral 
sequences for yeast equession arc disclosed in EPO 175,261, published Maich 26, 
1966. Hiese constructs may also be expnased in mamm a li a n cells such as 
Chinese hamster ovary (CHO) cells usmg an SV40-dihydrofolate reductase vector 
5 (NficheUe et aL (1984)). 

In addition, poitioDS of flie particle-forming proton codmg sequence 
may be replaced with codons encoding an HCVqjitope. lii diis replacement, 
regions which arc not required to mediate the aggregation of the units to form 
immunogenic particles in yeast or mammals can be deleted, thus eliminating 
10 adrfirintial HBV antigenic sites from cmnpetition with the HCV qritope. 

n.E. Preparation of Vaccines 

Vaccines may be prepared from one or more immunogenic 
polypeptides derived ftom HCV. These polypqrtides may be ejcpiessed in 

15 various host cells {e.g. , bacteria, yeast, insect, or m ammalian cells), or 
alternatively may be isolated from viral preparations or made synthetically. 
Single- or multi-valent vaccines against HCV may be comprised of one or more 
epitopes from one or more structural ptotems, and/or one or more qpitqpes from 
one or mrae nonstructural proteins. These vaccines may be comprised of, for 

20 example, recombinant HCV polypeptides and/or polypqrtides isolated from the 
virions. In particular, vaccmes are contemplated conqirising one or more of the 
following HCV proteins, or subunit arrtigens derived therefrom: El, E2, C, NS2, 
NS3, NS4 and NS5. Particularly preferred arc vaccines comprising El and/or E2, 
or subunits thereof. 

25 In addition to the above, it is also possible to prepare live vacdnes 

of attenuated microorganisms wMdi e:q>ress one or more recomhmant HCV 
polypq)tides. Suitable attenuated microorganisms are known in the art and 
include, for example, viruses (e.g., vaccinia virus (see Brown et al. (1986)), as 
well as bacteria. 

30 The prqjaralion of vaccmes which contain an immunogenic 

polypeptide(s) as active ingredients, is known to one skilled in the art. Tjfpically, 
such vaccines are prepared as injectables, either as liquid solutions or suspensions; 
solid forms suitable for solution in, or suspension in, liquid prior to mjection may 
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also be prqsared. The piqiaration may also be emulsified, or the protein 
e!ic2q>sulated in liposomes. Tlie active immunogenic ingredients are often mixed 
with excipients which aie phaxmacsutically acceptable and compatible with the 
active ingiedi^ Suitable excq)ients are, for example, water, saline, dextrose, 
5 glycerol, ethanol, or the like and combinati ns thereof. In addition, if desired, the 
vaccine may contain nunor amounts of auxiliaiy substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants which enhance the 
effecdveness of the vaccine. Examples of adjuvants which may be effective 
include but aie not limited to: ahmdinnm hydroxide, N-acetyl-muramyl-L-threonyl- 

10 D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isogiutamine (CGP 
11637, referred to as nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L- 
alanine-2-(l ''2'-d5)aImitoyl-sn-glycero-3-hydroxyphosphoiyloxy)-ethylamine (CGP 
19835A, refened to as MTP-PE), and RIBI, which contains three components 
extracted from bacteria, monophosphoiyl Iqnd A, trehalose dimycolate and cell 

15 wall skeleton OMPL+TDM-f CWS) in a 2% squalene/Tween® 80 emulsion. The 
effectiveness of an adjuvant may be detennined by measuring the amount of 
antibodies directed against an immunogenic polypeptide containing an HC V 
antigenic sequence resulting from administration of this polypeptide in vaccines 
which are also coniprised of the various adjuvants. 

20 Hie vaccines are conventionally administered parenterally, by injec- 

tion, for example, either subcutaneously or intramuscularly. Additional 
formulations which are suitable for other modes of administration include 
suiq)ositories arui, in some cases, oral formulations. For suppositories, traditional 
bmders and carriers may include, for example, polyalkylene glycols or 

25 triglycerides; such siq)positories may be formed from mixtures containing the ac- 
tive ingredient in the range of 0.5% to 10%, preferably 1 %-2%. Oral formu- 
lations include such normally employed excipients as, for example, pharmaceutical 
grades of marmitol, lactose, starch, magnesium stearate, sodium saccharine, 
cellulose, magnesium carbonate, and the like. These compositions take the form 

30 of solutions, suspensions, tablets, pills, capsules, sustained release formulations or 
powders and contain 10%-95% of active ingredient, preferably 25%-70%. 

The proteios may be formulated into the vaccine as neutral or salt 
forms. PharmaceuticaUy acceptable salts include the acid addition salts (formed 
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wifli ftee amino groups of the pqrtide) and which are fonned with inoiganic adds 
such as, for example, hydrochloric or phosphoric acids, or such organic adds such 
as acetic, oxalic, tartaric, maldc, and the like. Salts fonned with the free 
caifaoxyl groups may also be derived ftom inorganic bases such as, for example, 
5 sodium, potassram, ammonium, caldum, or ferric hydroxides, and such organic 
bases as iscpropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 
procaine, and the like. 

JLF, Dosa^ and ArfniTmsfra tion of Vaccines 

10 The vaccines are administered in a manner compatible with the 

dosage formulation, and in such amount as will be prophylactically and/or 
therapeutically effective. The quantity to be administCTsd, which is generally in 
the range of 5 A*g to 250 of antigen per dose, depends on the subject to be 
treated, capacity of tiie subject's immune system to synthesize antibodies, and the 

15 degieeof protection desired. Precise amounts of active ingredient required to be 
adnrimstered may dqwnd on flie judgment of the practitioner and may be peculiar 
to each subject 

The vaccine may be given in a single dose schedule, or preferably 
in a multq)le dose schedule. A multqjle dose schedule is one in which a primary 

20 course of vacdnation may be witii 1-10 separate doses, followed by other doses 
given at subsequent time intervals required to maintain and or reenforce the 
nmnune response, for example, at 1-4 months for a second dose, and if needed, a 
subsequent dose(s) after several months. The dosage regimen will also, at least in 
part, be determined by the need of the individual and be dependent upon the 

25 judgment of the practitioner. 

In addition, flie vaccine containing the immunogenic HCV antigen(s) 
may be administered in conjunction with other immunoregulatory agents, for 
example, immune globulins. 

30 ir.G. Preparation of Antibodies Against HCV Epitopes 

The immunogenic polypeptides described herein are used to produce 
antibodies, includmg polyclonal and monoclonal. If polyclonal antibodies are 
desired, a selected mammal (e.^., mouse, rabbit, goat, horse, ^fc.) is immunized 
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with an immunogenic polypq)tide bearing an HCV epitope(s). Serum from the 
immunized animal is collected and Heated according to known procedures. If 
serum containing polyclonal antibodies to an HCV epitope contains antibodies to 
other antigens, the polyclonal antibodies can be purified by inmiunoaffinity 
5 chiomatogr^hy. Techniques for producing and processing polyclonal autism are 
known in the art, see for example, May^ and Walker (I987).Altematively, 
polyclonal antibodies may be isolated from a mammal which has been previously 
infected with HCV. 

Monoclonal antibodies directed against HCV epitopes can also be 

10 readily prcxluced by one skilled in the art. The general methodology for making 
monoclonal antibodies by hybridomas is well known. Immortal antibody- 
producing cell lines can be created by cell fusion, and also by other techniques 
such as direct transformation of B lynq)hocytes with oncogenic DNA, or 
transfection with ^stein-Barr virus. See, M. Schreier cf al, (1980); 

15 Hammeriing et al (1981); Kennett et al. (1980); s^ also, U.S. Patent Nos. 

4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632; 
and 4,493,890. Panels of monoclonal antibodies produced against HCV epitopes 
can be screened for various properties; for isotype, q)itope affinity, etc. 

Antibodies, both monoclonal and polyclonal, which are directed 

20 against HCV epitopes are particularly useful in diagnosis, and those which are 
neutralizing are useful in-passive* immunotherapy. Monoclonal antibodies^ in 
particular, may be used to raise anti-idiotype antibodies. 

Anti-idiotype antibodies are immunoglobulins which cany an 
"internal image" of the antigen of the infectious agent against which protection is 

25 desired. See, for example, Nisonoff, A., et al (1981) and Dreesman et al 

(1985). Techniques for raising anti-idiotype antibodies are known in the art. See, 
for example, Grzych (1985), MacNamara et al. (1984), and Vytdehaag et aL 
(1985). These anti-idiotype antibodies may also be useful for treatment, 
vaccination and/or diagnosis of NANBH, as well as for an elucidation of the 

30 immunogenic regions of HCV antigens. 
n.H. Immunoassav and Diagnostic Kits 

Both the polypeptides and the antibodies of the present invention are 
useful in immunoassays to detect presence of HCV antibodies, or the presence of 
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the virus and/or HCV polypqrtides (or epitopes), in, for example, biological 
samples. Design of the immunoassays is subject to a great deal of variation, and 
many formats are known in the art. The immunoassay will utilize at least one 
viral epitope derived from HCV. In one embodiment, the immunoassay uses a 
5 combination of viral epitopes derived from HCV. These epitopes may be derived 
firom the same or from different viral polypeptides, and may be in separate 
recombinant or natural polypeptides, or together in the same recombinant polypq>- 
tides. An immunoassay may use, for exaaq>le, a monoclonal antibody directed 
towards a viral epitope(s), a combination of monoclonal antibodies directed 

10 towards epitopes of one viral antigen, monoclonal antibodies directed towards 
epitopes of different viral antigens, polydonal antibodies directed towards the 
same viral antigen, or polyclonal antibodies directed towards different viral 
antigens. Protocols may be based, for example, upon competition, or direct re- 
action, or sandwich type assays. Protocols may also, for example, use solid sup- 

15 ports, or may be by inununopredpitation. Most assays involve the use of labeled 
antibody or polypeptide; the labels may be, for example, enzymatic, fluorescent, 
chemiluminescent, radioactive, or dye molecules. Assays which amplify the 
signals from the probe are also known; examples of which are assays which utilize 
biotin and avidin, and enzyme-labeled and mediated immunoassays, such as 

20 ELISA assays (described below). 

Typically, an immunoassay for an anti-HCV antibody(s) will involve 
selecting and preparing the test sample suspected of containing the antibodies, such 
as a biological sample, then incubating it with an antigenic (i.e., epitope- 
containing) HCV polypq)tide(s) under conditions that allow antigen-antibody 

25 complexes to form, and then detecting the formation of such complexes. Suitable 
incubation conditions are well known in tiie art. The immunoassay may be, 
without limitations, in a heterogenous or in a homogeneous fonnat, and of a 
standard or competitive type. 

In a heterogeneous fonnat, the polypeptide is typically bound to a 

30 solid support to facilitate separation of the sample from the polypeptide after 

incubation. Examples of solid supports that can be used are nitrocellulose (e.g, , in 
membrane or microtiter well form), polyvinyl chloride (e^g., in sheets or 
microtiter wells), polystyrene latex (e.g., in beads or microtiter plates. 
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polyvinyfidine fluoride (known as Inmnilon*^, diazodzed paper, nylon membranes, 
acdvated beads, and Protein A beads. For example, Dynatech Immulon® 1 or 
Immulon 2 microtiter plates or 0.2S inch polystyrene beads (Precision Plastic 
Ball) can be used in the heterogmeous fbnnat. The solid siq)poit containing the 
5 antigenic polypeptide is typically washed after sq)acating it from the test sample, 
and prior to drtection of bound antibodies. Both standard and competitive fonnats 
aie known in the art. 

In a homogeneous fbnnat, the test sample is incubated with antigen 
in solution. For example, it may be under conditions that will precipitate any 
10 antigen-antibody complexes which are formed. Both standard and competitive 
fonnats for these assays are known in the art. 

In a standard format, the amount of HCV antibodies forming the 
antibody-antigen complex is diiectiy monitored. This may be accomplished by 
d^ormining whether labeled anti-xenogenic (eg. , anti-human) antibodies which 
15 recognize an q>itope on anti-HCV antibodies will bind due to complex fonnation. 
In a comp^ttive fonnat, the amount of HCV antibodies in the sample is deduced 
by monitoiing the competitive effect on the binding of a known amount of labeled 
antibody (or other competing ligand) in the complex. 

Complexes formed comprising anti-HCV antibody (or, in the case of 
20 conq)etitive assays, the amount of competing antibody) are detected by any of a 
number of known technique, dq)ending on the fonnat For example, unlabeled 
HCV antibodies in the complex may be detected using a conjugate of 
antbcenogeneic Ig coniplexed witii a label, {e,g., an enzyme label). 

In immunoassays where HCV polypeptides are the analyte, the test 
25 sample, typically a biological sample, is mcubated with anti-HCV antibodies under 
conditions that allow the fonnation of antigen-antibody complexes. Various 
formats can be employed. For example, a "sandwich assay" may be employed, 
where antibody bound to a solid support is incubated with the test sample; washed; 
incubated with a second, labeled antibody to the analyte, and the support is washed 
30 again. Analyte is detected by determining if the second antibody is bound to the 
support. In a conq)etitive fonnat, which can be either heterogeneous or 
homogeneous, a test sample is usually incubated with antibody and a labeled, 
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competnig antigea is also incafaaied, either sequratially or simultaneously. These 
and other fonnats are well known in the art 

EfBdent detection systems for HCV infection may inchide the use 
of panels of epitopes, as described above. The epitopes in the panel may be 
5 constructed into one or multq)le polypeptides. The assay s for the varying q)itopes 

may be sequential or simultaneous. 

The enzyme-linked immunosorbent assay (ELBA) can be used to 
measure either antigen or antibody concentrations. This method dq)ends upon 
coigugation of an enzyme to eidier an antigen or an antibody, and uses the bound 

10 enzyme activity as a quantitative label. To measure antibody, the known antigen 
is fixed to a solid phase (c.^., a microplate or plastic cup), incubated with test 
sennn dihitions, washed, incubated with anti-inmranoglobulin labeled with an 
enzyme, and washed again. Enzymes suitable for labeling are known in the art, 
and inctode, for example, horseradish peroxidase. Bizyme activity bound to the 

15 solid phase is measured by addmg the specific substrate, and determining product 
formation or substrate utilization colorimetrically. The enzyme activity bound is a 
direct function of the amount of antibody bound. 

To measure antigen, a known specific antibody is fixed to the solid 
phase, the test material containing antigen is added, after an incubation the solid 

20 phase is washed, and a second enzyme-labeled antibody is added. After washing, 
substrate is added, and enzyme activity is estimated colorimetrically, and related to 
antigen concentration. Kits suitable for immunodiagnosis and containing the 
qjpropriate labeled reagents are constructed by packaging the ^rppriate 
Tnff tf>riflls^ including the polypeptides of the invention containing HCV q)itopes or 

25 antibodies directed against HCV epitopes in suitable containers, along with the 
remaining reagents and materials required for the conduct of the assay, as well as 
a suitable set of assay instructions. 



HI. General Methods 

30 The general techniques used in the practice of the present invention 

can be found in, for example, the references cited herein, particularly EPO Pub. 
Nos. 318,216 and 388,232, as well as the references in the bibUography, which 
arc incorporated herein by reference. 
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IV. TtTamplft^ 

Described below are examples of the present inventi n which ate 
provided only for illustrative purposes, and not to limit the scope of the present 
invCTtion. In light of the present disclosure, num^ous embodiments within the 
5 scope of the claims will be zppaimt to those of ordinary skill in the ait. 

IV. A. Epitope Mapping of HCV Genome 

The following example is the result of an epitope mapping 
expGdmwt conducted on the HCVl polyprotem sequence shown in Fig. L As 

10 shown m Fig. 3-11, there are h^rograeities among HCV isolates, indicating that 
these amino add substitutions can be made in the octamers described below. In 
addition to substitutions with amino adds ftom the corresponding location in other 
HCV isolates, substitutions with synthetic analogs of the particular amino acids or 
conservative substitutions based on charge, etc. ^particularly when the substitution 

IS does not destroy antibody binding) are within the scope of the invention. 

IV.AJ. Synthesis of Overlappinp Peptides 

Polyethylene pins arranged on a block in an 8x12 array (Coselco 
Mimetqpes, ^^ctoria, Australia) were prepared by pladng the pins in a bath (20% 

20 v/v piperidine in dimethylformamide (DMF)) for 30 minutes at room temperature. 
Hie pins were then removed, washed in DMF for S min, then washed in methanol 
four times (2 min/wash). Hie pins were allowed to air dry for at least 10 min, 
then washed a final time in DMF (S min). l-Hydroxybenzotriazde (HOBt, 367 
mg) was dissolved in DMF (80 mL) for use in cotq>ling Fmoc-protected amino 

25 acids: Fmoc-L-Ak-OP^, Fmoc-L-Cys(Trt)-OPI^, Fmoc-L-Asp(0-lBu)-OI^ 

Fmoc-L-Ghi(0-/Bu)-OPQ[), Fmoc-L-Phe-OP^, Fmoc-Gly-OP^, Fmoc-L-His(Boc)- 
OP^, Fmoc-L-ne-OPQ), Fmoc-L-Lys(Boc)-OP^, Fmoc-L-Leu-OPI^, Fmoc-L-Met- 
OP^, Fmoc-L-Asn-OPQ?, Fmoc-L-Pro-OP^, Fmoc-L-Gln-OP^, Fmoc-L- 
Arg(Mtr)-OPfy, Fmoc-L-Ser(/-Bu)-ODhbt, Fmoc-L-Thr(r-Bu)-ODhbt, Fmoc-L-Val- 

30 OP^, and Fmoc-L-iyr-OP^. 

The protected amino acids were placed in microther plate wells with 
HOBt, and the pin hlr placed over the plate, immersing the pins in the wells. 
The assembly was then sealed in a plastic bag and allowed to react at 2S**C for 18 
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hours to couple the fiist amino acids to tfie pins. The block was dien removed, 
and the pas washed with DMF (2 min), MeOH (4X2 min), and again with 
DMF (2 min) to clean and dq)rotect the bound amino adds. The procedure was 
repeated for each additional amino acid co\q)led, until all octamers had been pre- 
5 pared. 

The ftee N-termini were then acetylated to compensate for the free 
amide, as most of the qritopes are not found at the N-termimis and thus would not 
have the associated positive chaige. Acetylation was acconqilished by filling the 
wells of a microtiter plate with DMF/acetic anlydride/triethylamine (5:2:1 v/v/v) 
10 and allowing the pins to react in the wells for 90 min at 20"'C. The pins were 

then washed with DMF (2 min) and MeOH (4 x 2 min). and air dried for at least 

10 min. 

Hie side chain protecting groups were lonoved by treating the pins 
with trifluoroacetic add/phenol/dithioethane (95:2.5:2.5, v/v/v) in polypropylene 
15 bags for 4 hours at loom temperature. The pins were then washed in dichloro- 
methane (2x2 min), 5% di-isopropylethylamine/dichloromethane (2x5 min), 
dichloiometfaane (5 min), and air-dried for at least 10 min. The pins were then 
washed m water min), MeOH (18 hours), dried in vacuo, and stored in sealed 
plastic bags over silica geL 

20 

IV.A.2. Assay of Peptides 

Octamer-bearing pins prepared as above were first treated by son- 
icating for 30 min in a disruption buffer (1 % sodium dodecylsulfate, 0.1 % 2-mer- 
c^toetijanol, 0.1 M NaHjPO*) at 60°C. The pins were then immersed several 

25 times in water (jSO^Q, followed by boiling MeOH (2 min), and allowed to air diy. 
The pins wrae then precoated for 1 hour at 25'C in miootitar wells containing 
200 /iL blocking buffer (1 % ovalbumin, 1 % BS A, 0. 1 % Tween®, and 0.05 % 
NaNj in PBS), with agitation. The pins were then immersed in microtitBr wells 
containing 175 fiL antisera obtained from human patients diagnosed as having 

30 HCV and allowed to incubate at 4"'C ovenaghL The pins wrae assayed against 
antisera from three individual patients. Specimen ffPAA 3663-s ("A") exhibited 
strong reaction to HCV Western blots, HCV competitive ELBA, HCV ELISA to 
clone ClOO-3 (at 1:1000 dilution), and KIBA responses of >4+ to ClOO, 5-1-1, 



wu y;*/003fe5 ,33. PCr/US92/0S388 

and C33c (C22 not done). (Andgen/done names are per EPO Pub. Nos. 318,216 
and 388,232, as well as tiiose described in the literatine regarding HCV 
immxmoassays available from Ortho Diagnostics Systems, Inc.) Neat plasma was 
diluted 1:500 in blocking buffer. Specimen ffPAA 33028 ("B") exhibited strong 
5 reacti n to HCV Western bl ts, HCV competitive ELISA, HCV EUSA to clone 
ClOO-3 (at 1:500 dihition), and RIBA responses of >4+ to ClOO, 5-1-1, C33C 
and C22. Polyclonal antisera was partially purified by passage through a protein 
A column, and was used at a dilution of 1:200 in blocking buffer. Specimen 
#PAA s32931 ("C") exhibited moderate reaction to HCV Western blots (3+), 

10 HCV competitive ELBA, HCV ELBA to clone ClOO-3 (at 1:64 dilution), and 
RIBA responses of 3+ and 4+ to ClOO and 5-1-1, respectively (C33c and C22 
not done). Polyclonal antisera was partially purified by passage through a protein 
A column, and was used at a dilution of 1:500 in bloddng buffer. 

The pins were washed in PBS/Tween* 20 (4 x 10 min) at room 

15 temperature, then incubated in miciotiter wells containing horseradish peroxidase- 
labeled goat anti-Human Ig antisera (175 /iL, 1:2000 dilution m blockmg buffer 
without NaNa) for 1 hour at 25''C with agitation. The antihuman antisera is spe- 
cific for human Ig light and heavy chains, and reacts with both IgG and IgM 
classes. The pins were again washed in PBS/Tween*** 20 (4 X 10 min) at room 

20 temperature. Substrate solution was prepared by diluting NaH2P04 (1 M, 200 mL) 
and citric acid (1 M, 160 mL) to 2 L with distilled water, adjusting the pH to 4.0. 
Azino-di-3-ethylbenzthiazodinsulfonate (ABTS, 50 mg) and hydrogen peroxide 
(0.3 /iL/mL) was added to 100 mL of buffer immediately prior to use to conq}lete 
the substrate solution. The substrate solution (150 /iL) was added to each well of 

25 a microtit^ plate, and the pins inunersed in the wells and incubated at 25°C in the 
dark. After color developed, the reactions were halted by removing the pms, and 
absorbance of the solutions read at 40S nm. 

The octamers listed below were immunoreactive with anti-HCV 
antisera. Pq)tides reacting with all three antisera are listed as q)itopes, while pep- 

30 tides reacting with only one or two antisera are listed as weak q)itopes (indicated 
by " - "). Particularly strong epitopes are labeled with letters ratiier than numbers 
(e.g., J^AA). 
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15 



20 



25 
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35 



40 
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AA# Sequence 


AA# Sequence 




23 


KEPGGGQA 




87 


GNEGCGWA 


24 


EPGGGQAV 


Ep2 




NEGCGWAG 


25 


PGGGQAVG 




88 


26 


GGGQAVGG 




89 


EGCGWAGW 


27 


GGQAVGGV 




90 


GCGWAGWL 


28 


GQAVGGVY 




91 


CGWAGWLL 


29 


QAVGGVYL - 




92 


GWAGWLLS 


30 


AVGGVYLL 




93 


WAGWLLSP 


31 


VGGVYLLP 




Oil 


AGWLLSPR 


32 


GGVYLLPR 




95 


GWLLSPRG 


33 


GVYLLPRR 




96 


WLLSPRGS 


34 


VYIXPRRG 




97 


LLSPRGSR 


35 


YLLPRRGP 




98 


LSPRGSRP 


36 


LLPRRGPR 




99 


SPRGSRPS 


66 


PKARRPEGl^l 




100 


PRGSRPSW Ep3 


67 


KASBPEGR 




101 


RGSRPSWG 


68 


ASRPEGRT 




102 


GSRPSWGP 


69 


RRPEGRTW 




103 


SRPSWGPT 


70 


RPEOITWA 






TVPASAYQ ^ 


71 


PEGRTWAQ 




186 


72 


EGRTWAQP 




187 


VPASAYQv 


73 


CaiTWAQPG ^A 




188 


PASAYQVR 


74 


RTWAQPGY 




189 


ASAYQVRN 


75 


TWAQPGYP 




190 


SAYQVRNS 


76 


WAQPGYPW 




191 


AYQVENST 


77 


AQPGYPWP 






DCTNSSIV — 


78 


QPGYPWPL 




206 


79 


PGYPWPLG 








80 


GYPWPLYG 




223 


TPGCVPCV - 


81 


YPWPLYGN 






EGNASRCW Ed5 


82 


PWPLYGNE 




232 


83 


WPLYGNEG 








84 


PLYGNEGC 




256 


TQLRRHID — 


85 


LYGNEGCG 








86 


YGNEGCGW 




286 


LVGOLFTF — 








297 


RHWTTOGC Ed6 








298 


HWTTQGCN 








299 


WTTQGCNC 








321 


MMMNWSPI ~ 








347 


DMIAGAHW Bgl 



LAGIAYFS %8 
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25 



30 
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40 



413 IJNTNGSW 1^ 

414 INTNGSWH 

432 SLNTGWLA - 

465 FDQGWGHi^C 

466 DQGWGFIS 

467 QGWGPISY 

468 GWGPISYA 

469 WGPISYAN 

470 GPISYANG 

471 HSYANGS 

480 PDQRPYCW IBpD 

481 DQRPYCWH 

482 QRPYCWHY 

483 RPYCWHYP 

484 PYCWHYPP 

500 KSVCGPVY iEp9 

501 SVCGPVYC 

502 VCGPVYCE 

521 RSGAPTYS ^10 

540 NNTRPPLGEpE 

541 NTRPELGN 

542 TRPPLGNW 

543 RPPLCaWF 

544 PPLGNWFG 

545 PLGNWFGC 

546 LGNWFGCT 

547 GNWFGCTW 

548 NWFGCTWM 

549 WFGCTWMN 

579 IHCPTDCF - 



^12 



594 YSRCGSGP 

595 SRCGSGPW 

596 RCGSGPWL 

597 CGSGPWLT 

598 GSGPWLTP 

599 SGPWLTPR 

600 GPWLTPRC 

601 PWLTPRCL 

602 WLTPRCLV 

603 LTPRCXVD Bp¥ 

604 TPRCLVDY 

605 PRCLVDYP 

606 RCXVDYPY 

607 CLVDYPYR 

608 LVDYPYRL 

609 VDYPYRLW 

610 DYPYRLWH 

611 YPYRLWHY Fg 

612 PYRLWHYP 

613 YRLWHYPC 

641 EAACNWTR 



662 LSFLLLn ^13 

663 SPTTinr 

664 PLLUnQ 

665 LtUnQW 

685 LSTGLIHL - 



45 



705 
706 

729 

782 

783 
784 
785 
786 
787 
788 
789 



VGSSIASW - 
GSSIASWA 

ARVCSCIW Epl4 

WVPGAVYT 

VPGAVYTF 

PGAVYTFY 

GAVYTFYG 

AVYTFYGM 

VYTFYGMW 

YTFYGMWP 

TFYGMWPL 



801 LALPQRAY ~ 
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851 


RVEAQLBV]^ 




1384 


VIKGGRHL ^20 


852 


VEAQLHVW 






LGINAVAY ^21 


853 


EAQLHVWI 




1410 


854 


AQLHVWIP 




1411 


G[NAVAYY 


855 


QLBVWIPP 












1454 


CNTCVIQT - 


893 


LAVFGPLW - 














1492 


GRGKPGIY ^22 


916 


QGLLRFCA - 




1493 


RGEPC3YR 


928 


MIGGHYVQ ^15 




1532 


PAEITVRLEp23 






1533 


AETTVRLR 


946 


TGTYVYNHEpI 




1534 


BITVRLRA, 






1535 


TTVELRAY 


952 


NHLTPLRD EpJ 








953 


HLTPLRDW 




1560 


GVHGLm ]^4 


954 


LTPLRDWA 




1561 


VFIGUHI 


1026 


LAPTTAYA - 




1581 


ENLPYLVA 






^25 






1072 


TONGVCW^K 




1567 


NLPYLVAY 






1568 


LPYLVAYQ 


1109 


LVGWPAPQ ^16 




1569 


FYLVAYQA 






1570 


YLVAYQAT 


1171 


CPAGHAVG Bpn 




1571 


LVAYQATV 


1113 


PAGBAVQI 




1572 


VAYQATVC 


1114 


AGHAVGIF 




1573 


AYQATVCA 


1115 


GHAVQIFR 




1574 


YQATVCAR 


1116 


HAVGEFEA 




1575 


QATVCARQ 


1117 


AVGDFRAA 




1576 


ATVCARQA 








1577 


TVCARQAP 


1218 


WPQSEQV 












1601 


PPSWDQMW 


1240 


VPAAYAAQ - 


1^6 












1602 


PSWDQMWK 


1260 


ATEXiFGAY - 




1603 


SWDQMWKC 








1604 


WDQMWKCX 


1280 


GVBnTGS ^18 




1605 


DQMWKCII 


1281 


VBTTIGSP 




1606 


QMWKCUR 


1282 


EriTGSPI 




1607 


MWKCURL 


1283 


nTGSPrr 








1284 


TTGSPrrY 




1615 


KPTLHGH]^27 


1285 


TGSPITYG 




1616 


PTLHGPIP 








1617 


TLHGPIPL 


1322 


DATSILGI - 




1618 


LSGPIPIX 








1619 


HGPIPLLY 


1338 


TAGARLW ~ 




1620 


GPIPLLYR 



1371 GEIPFYGK Epl9 



1655 WTSTWVL - 
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1694 


UPDREVLBsM 




1966 


SECTIPCS 


1695 


IPDREVLY 




1967 


ECTIPCSG 








1968 


CnPCSGS 


1710 


ECSQHLFY 




1969 


UPCSGSW 


1711 


CSQHLPYI 








1712 


SQHLPYIE 




1999 


LMPQLPGI 








2000 


MPQLPGEP 


1728 


EKQKALGL BpO 




2001 


PQLPGIPE 


1729 


KQKALGLL 




2002 


QLPGIPEV 








2003 


LPGIPHVS 


1758 


EmWAELM^P 




2004 


PGIPEVSC 


1759 


lEWAKLMW 




2005 


GIPEVSCQ 


1760 


EWAEIMWN 




2006 


IPEVSCQR 


1761 


WAKLMWNE 




2007 


PBVSCQRG 


1762 


AELMWNEI 




2008 


EVSCQRGY 








2009 


VSCQRGYK 


1781 


LPGNPAIA ~ 




2010 


SCQRGYKG 








2011 


CQRGYKGV 


1808 


LFNILGGW ^28 




2012 


QRGYKGVW 








2013 


RGYKGVWR 


1821 


AAPGAATA ~ 




2014 


GYKGVWRG 


1851 


ILAGYGAG ^29 




2024 


XMHTRCHC 






B>31 






1880 


VNLLPAIL ~ 














2048 


VGPRICRN EpU 


1908 


PGEGAVQW ^G 




2049 


GPRICRNY 


1909 


GEGAVQWM 




2050 


PRICRNYW 


1910 


EGAVQWMN 




2051 


RICRNYWS 


1911 


GAVQWMNR 




2052 


ICRNYWSG 


1912 


AVQWMNRL 




2053 


CRNYWSGT 


1913 


VQWMNRU 




2054 


RNYWSGTE 








2055 


NYWSGTEP 


1925 


RGNHVSPI IBpR 




2056 


YWSGTEPI 








2057 


WSGTEPIN 


1940 


AAARVTAIEpSO 








1941 


AARVTAIL 




2071 


TPLPAPNY Ep32 


1942 


ARVTAILS 






1943 


RVTAILSS 




2088 


EEYVIRQV EpV 


1944 


VTAILSSL 




2089 


EYVIRQVG 


1945 


TAILSSLV 




2090 


YVmQVGD 


1946 


AILSSLVT 




2091 


VIRQVGDF 


1947 


ILSSLVTQ 




2092 


IRQVGDFH 


1948 


LSSLVTQL 




2093 


RQVGDFHY 


1949 


SSLVTQLL 








1950 


SLVTQLLR 




2108 


DNLKCPCQ ~ 


1951 


LVTQLLRR 
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2122 


EIELDGVR^W 




22oU 


TS'C A A T "DTI/ T7«7 


2123 


DELDGVRL 




2281 


"OA OAT HVW 


2124 


ELDGVRLH 




22o2 




2125 


LDGVRLHR 








2126 


DGVRLHRF 






AT PVWAl^P 


2127 


GVRUaRFA 




//O 7 




2128 


VRLHRFAP 




ZZoO 


pvw A ppnv 


2129 


KLHKFAPF 








2130 


LHRFAPPC 




ZZoo 


W A PPrj VMP 


2131 


HRFAPPCK 






A PPnVMPP 


2132 


RFAPPCKP 






PPnVMPPT 


2133 










2134 


APPCKPLL 






iririrJKAkllivX l^J^ 


2135 


PPCKPLLR 






JfirisJSjSJS.x V 


2136 


PCKPIJLRE 






PPTTTTPTW 
JrivEkXkivx V V 


2137 


f TKPl T,RHK 








2138 


KPLLREEV 






AT7T A^PCP Pn'^^ 


2139 


PTT-RKHVS 






PT A^PWG 


2140 


LLREEVSF J^X 








2141 


LREEVSFR 








2142 


REEVSFRV ^ 








2143 


EEVSFRVCj 








2144 








AESYSSMP Ed37 


2145 


VSFRVGLH 








2146 


SERVGLHE 






OXJVJO W d X V 


2147 


FRVGLBEx 








2148 


RVGLBEYP 














241 / 


V V l^L^iVlo 1 


2165 


EPEPDVAV - 
















VPP ^A/T^ YW 


2187 


GRELLAKuS — 






L.* wOiVld X W X 








2420 


CSMSYWIG 








2421 


SMSYWIGA 








2422 


MSYWIGAL 












2229 


ANLLWRQE 








2230 


NLLWRQEM 










T T \irDf\n\/(fz 








2232 










2244 


VfioxlNJvVV EpJJ 










i3oJciNA.V VI 








2/4o 


ailiNJvV VJLL» 








2247 


"CXTl*^ V7X7TT 








2248 


NKWILDS 








2249 


KWUDSF 








2250 


WIIJ3SFD 









2267 



EISVPAEI^34 
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2439 


QKLHNAL ^>BB 




2602 


UPLAVMGS 


2440 


KLPINALS 


EpDD 






2441 


LPINALSN 




2603 


PLAVMGSS 


2442 


FINALSNS 




2604 


LAVMGSSY 


2443 


INALSNSL 




2605 


AVMGSSYG 


2444 


NALSNSUL 




2606 


VMGSSYGE 


2445 


ALSNSLLR 




2607 


MGSSYGEQ 


2446 


LSNSLLRH 




2608 


GSSYGEQR 


2447 


SNST.T.KHH 




2609 


SSYGEQRV 


2448 


NSLLRHHN 




2610 


SYGEQRVE 


2449 


SLLRHHNL 




2611 


YGEQRVEE 


2450 


LLRHHNLV 




2612 


GEQRVEEL 


2451 


LRHHNLVY 








2452 


RHHNLVYS 




2632 


KTPMGFSY 


2453 


HHNLVYST 


Ep41 






2454 


HNLVYSTI 


2633 


TTMGFSYD 


2455 


NLVYSTTS 




2634 


PMGFSYDT 


2456 


LVYSTBR 




2635 


MGFSYDTR 








2636 


GFSYDTRC 


2469 


QKKVTFDR Ep39 




2637 


FSYDTRCE 


2470 


EKVTFDKL 




2638 


SYDTRCED 


2471 


KVTFDRLQ 








2472 


VITDRLQV 




2660 


YQCCDLDP - 


2473 


TEDRLQVL 








2474 


FDRLQVLD 




2676 


LTERLYVG 


2475 


DRLQVLDS 


^EE 






2476 


pTjOVLDSH 




2677 










2678 


ERLYVGGP 


2495 


ASKVKANL ~ 




2679 


RLYVGGPL 


2533 


RKAVraiN^CC 




2688 


NSRGENCG 


2534 


KAVTBINS 


^FF 












2689 


SRGENCGY 


2573 


GRKPASU^>40 




2690 


RGENCGYR 


2574 


RKPARUV 




2691 


GENCGYRR 


2575 


KPARUVF 




2692 


ENCGYRRC 


2576 


PARLTVFP 




2693 


1^ W X RJR\^J\. 


2577 


ARUVFPD 








2578 


RUVFPDL 




2707 


TSCGNTLI Ed42 








2721 


AACRAAGL ~ 








2757 


AFTEAMTR 






^3 












2758 


FTEAMTRY 








2759 


TEAMTRYS 








2760 


EAMTRYSA 








2761 


AMTRYSAP 








2162 


MTRYSAPP 
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2779 


PI_.HI JISL^ h^>44 


9878 
zo / 0 


r^T pphOR Ed47 


2780 




9870 
Zo #7 








2R80 


PPHQRLH 


275/4 


£lJJVXn.\jrAJK. V 


2881 


FDQSIHG 






2882 


nQKLHGL 




Tirs A /Tin? w 


2883 


lOKLHGLS 


z/yo 


urAvjJSuvV 1 1 


2884 


QKLHGLSA 


z/y / 


ArSITPVWT 


2885 


SLHGLSAF 




(jJSJvV I xJUl 


2886 


IHGLSAFS 




J3LKV I Xi^liv 


2887 


HGLSAFSL 


2800 


RVYYLrTKD 


zooo 


GLSAFSLH 


zoUl 


V X iLrlKUJr 


2889 


LSAFSLHS 


2802 


YYLTRDPT 


9800 








2891 


AFSLHSYS 


Zol / 




2892 


FSLBSYSP 






2893 


SLBSYSPG 


Zoio 




2894 


LHSYSPGE 




Xxvl\A X V 


2895 


HSYSPGEI 


ZoZU 


/UVfX XXr V 1^ 0 








JEVU.X& V X^O VV 








XXXJr vxNO WX^ 








XXr V 1^0 VtX^w 








XT V XNO ™ L^Jlri 






9R9^ 








ZoZO 


XND W XAJiNlX 






XOZ/ 


0 Yr JLAJX^XXiVX 






ZoZo 


W XAJriYXLLVXC* 






9890 
Z0Z7 


XAXi^ XLLVi r\r\. 






98^0 








98^1 


MTTMT5APT 






98^9 


XXiVX CaMT A.JL./ 






98^^ 








98^A 


AyTPAPTT WA 






98^^ 


T! APT! WAP 






98'^f^ 
ZOJO 


APTT WAPM 






/ 


PTTWAPMT 






98^8 








98^0 


T WAPMIIM 






9840 


WAPMTLMT 






2841 


ARMELMIH 






2842 


RMnMIHF 






2843 


MILMTHFE 







2863 LDCEIYGA ^)46 

45 2864 DCETYGAC 

2865 CEIYGACY 

2866 ETYGACYS 

2867 lYGACYSI 
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2912 


LGVPPLRA Bp^i 


2913 


GVPPLRAW 


2914 


VPPLRAWR 


2938 


AAICGKYL£p49 


2939 


AICGKYIE 


2940 


ICGKYLEN 


2966 


DLSGWETA 



10 ^iHH 

2984 VSHARPRW EpH 

15 IV.B. Diiferential Assay 

Hie folIowiBg assay was peifcmned to distinguish early antigens 
from later antigens. Antibodies to the eaily antigens may be detected, and used to 
diagnose HCV infection more quickly. 

Serial bleeds were obtained from a human patient presenting with 

20 elevated ALT, but negative for anti-ClOO-3 antibody. The five bleeds obtained 
prior to complete seroconversion (ClOO-3 positive) were pooled and used in the 
assay at a dilution of 1:2000. The assay was conducted as described in Section 
IV.A. above. However, one duplicate set of pins was incubated with horseradish 
peroxidase-labeled goat anti-Huoian IgG specific antisera^ while the other set was 

25 incubated with horseradish peroxidase-labeled goat anti-Human IgM specific anti- 
sera, ^itopes immunoreactive with IgM antibodies are eariy epitopes. 

The results indicated that most early epitopes are found in the 
region extending from about amino acid 480 to about amino acid 650. Particularly 
strong IgM epitopes were octamers beginning with amino add nos. 506, 510, 523, 

30 553, 562, 580, and the region from 590 to 620. Assays which employ antigens 
bearing q)itppes from this region will permit diagnosis of HCV infection at an 
early point than assays employing other antigens. 

We have additionally tested serial plasma specimens taken from five 
patients with open heart post-transfusion NANB hepatitis, with studies followed 

35 for 3-12 years. Initial bleed dates were less than one week apart. Each specimen 
was tested for IgG and IgM by EIA against one core antigen (C22), two envelope 
antigens (El and E2), and three nonstructural region antigens (C33c, ClOO, and 
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NS5). We found that the lesponse to C22 and C33c preceeded the IgG 
response for those antigens. NS-5 also induced an IgM response, but this 
response did not preceed the IgG response for that antigen. Thus, one can prepare 
assays capable of determining very early stages of infection by utilizmg qjitqpes 
5 derived ftom the C22 and C33c regions and assaying for IgM binding. Antibodies 
to the C33c region persisted for the longest period of time, suggesting that 
diagnostic assays directed toward C33c should be the most rdiable. 

IV.C. Sequence V ariations in HCV Isolates ftom Different Individuals 
10 Isolates of HCV which contain sequences which deviate ftom 

CDC/HCVl were identified in human individuals, some of whom were 
serologically positive for anti-ClOO-3 antibodies (EClO was antibody negative). 
Identification of these new isolates was accomplished by cloning and sequencing 
segments of the HCV genome which had been amplified by the PCR technique 
15 using CDC/HCl ^quences. The method utilizes primers and probes based upon 
the HCV cDNA sequences described herein. The first step in the metiiod is the 
synthesis of a cDNA to either the HCV genome, or its replicative intermediate, 
usmg reverse transcriptase. After synthesis of the HCV cDNA, and prior to 
amplification, the RNA in die sample is degraded by techniques known in tfie art. 
20 A designated segment of the HCV cDNA is then amplified by the use of the 

^rprqpriate primers. The amplified sequences are cloned, and clones containing 
the amplified sequences are detected by a probe which is conq)lemCTtary to a 
sequence lying between the primers, but which does not overly the primers. 

25 IV.C.1. HCV Isolates Isolated from Humans in tiie U.S. 

Blood sainples which were used as a source of HCV virions were 
obtained ftom the American Red Cross in Charlotte, North Carolina, and ftom the 
Community Blood Center of Kansas, Kansas City, Missouri. The samples were 
screened for antibodies to the HCV ClOO-3 antigen using an EUSA assay and 

30 subjected to supplemental Western blot analysis using a polyclonal goat anti-himian 
HRP to measure anti-HCV antibodies. Two samples, #23 and #27, ftom the 
American Red Cross and ftom the Community Blood Center of Kansas, 
respectively, were determined to be HCV positive by these assays. 



Vixal paxticles piesrat in the serum of these samples were isolated 
by ultrac^itrifugation under the conditions described by Bradley et al (19SS). 
RNA was extracted from the particles by digestion with proteinase K and SDS at 
final concentrations of 10 fig/mL proteinase K, and 0.1 % SDS; digestion was for 
5 1 hour at 37**C, Viral RNA was further purified by extraction with chlorofonn- 
phenol. 

HCV RNA in the preparation of RNA was reverse transciibed into 
cDNA. After both strands of the cDNA were synthesized, the resulting cDNA 
was then amplified by the VCSL method. The HCV cDNAs in three clones derived 
10 from each HCV isolate, were subjected to sequence analysis. Analysis was 
essentially by the method described in Chen and Seebuig (1985). 

Consensus sequences of the clones derived from HCV in samples 23 
and 27 are shown in Fig. 3 and Fig. 4, respectively. The variable sequences are 
also shown in these figures, as are the amino acids encoded in the consensus 
15 sequences. 

Figures 5 and 6 show comparisons of the aligned positive strand 
nucleotide sequences (Fig. 5) and putative amino add ^quences (Fig. 6) of 
samples 23, 27, and HCVl. The amino acid sequence of HCVl in Fig. 6 
represents amino add numbers 129-467 of the HCV polyprotein encoded by the 

20 large ORF in the HCV genomic RNA. An examination of Figs. 5 and 6 show that 
there are variations in the sequences of the three isolated clones. Tlie sequence 
variations at the nucleotide level and the amino add level are summarized in the 
table immediately below. In the table, the polypeptides designated S and NSl 
represent amino acid numbers 130 to —380, and 380 to -470, respectively, as 

25 those domains were previously known. The numbering is from the putative 

initiator metiiionine. The tenninology S and NSl is based upon the positioning of 
the sequences encoding the polypeptides using the Flavivirus model. As discussed 
above, however, recent evidence suggests that there is not total comdation 
between HCV and the Flaviviruses with regard to viral polypeptide domains, 

30 paiticulariy in the putative E/NSl domains. Indeed, HCV polypeptides and their 
codmg domains may exhibit substantial deviation from the Flavivirus model. 
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Viial particles piesrat in the serum of these samples were isolated 
by ultraceotrifugation under the conditions described by Bradley et al. (1985). 
RNA was extracted from the particles by digestion with proteinase K and SDS at 
final concentrations of 10 /ig/mL proteinase K, and 0.1 % SDS; digestion was for 
5 1 hour at 37**C. Viral RNA was further purified by extraction with chloroform- 
phenol. 

HCV RNA in the prqjaration of RNA was reverse transcribed into 
cDNA. After both strands of the cDNA were synthesized, the raiting cDNA 
was then amplified by the PGR method. The HCV cDNAs in three clones derived 
10 from each HCV isolate, were subjected to sequence analysis. Analysis was 
essentially by the method described in Chen and Seebuig (1985). 

Consensus sequences of the clones derived from HCV in samples 23 
and 27 are shown in Fig. 3 and Fig. 4, respectively. TTie variable sequences are 
also shown in these figures, as are the amino acids encoded in the consensus 
15 sequences. 

Figures 5 and 6 show comparisons of the aligned positive strand 
nucleotide sequences (Fig. 5) and putative amino acid sequences (Fig. 6) of 
samples 23, 27, and HCVl. The amino acid sequence of HCVl in Fig. 6 
represrats amino add numbers 129-467 of the HCV polyprotein encoded by the 

20 large ORF in the HCV genomic RNA. An examination of Figs. 5 and 6 show that 
there are variations in the sequences of the three isolated clones. The sequence 
variations at the nucleotide level and the amino add level are summarized in the 
table immediately below. In the table, the polypeptides designated S and NSl 
represent annuo acid numbers 130 to -380, and 380 to -470, respectively, as 

25 those domains were previously known. The numbering is from the putative 

initiator methionine. The terminology S and NSl is based upon the positioning of 
the sequCTces encoding the polypeptides using the Flavivirus model. As discussed 
above, however, recent evidence suggests that there is not total correlation 
betwera HCV and the Flaviviruses with regard to viral polypq)tide domains, 

30 particulariy in the putative E/NSl domains. Indeed, HCV polypeptides and their 
coding domains may exhibit substantial deviation from the Flavivirus model. 
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TABLE: 

Sequence Homology 
Nudeotide EncodingAmiao Add Bacoded 



overall 


S 


NSl 


overall 


S 


NSl 


% 


% 


% 


% 


% 


% 


HCV1/HCV23 93 


95 


91 


92 


95 


87 


HCV1/HCV27 89 


93 


84 


89 


95 


82 


HCV23/HCV27 89 


93 


85 


90 


93 


84 



10 Although there are variations in the newly isolated HCV sequences, 

the cloned sequences from samples 23 and 27 (called HCV23 and HCV27) each 
contain 1019 nucleotides, indicating a lack of deletion and addition mutants in tfiis 
region in the selected clones. The sequences in Rgs. 5 and 6 also show that the 
isolated sequences aie not leananged in this region. 

15 A comparison of the consensus sequences fiw HCVl and for the 

other isolates of HCV is summarized in die Table, supra. The sequence variations 
between the chimpanzee isolate HCVl, and the HCVs isolated from humans are 
about the same as that seen between the HCVs of human origin. 

It is of interest that the sequence variations in two of the putative 

20 domains is not uniform. The sequence in a putative S region ^)pears to be 

relatively constant, and randomly scattered throughout the region. In contrast, a 
putative NSl region has a higher degree of variability than the overall sequence, 
and the variation appoais to be in a hypervariable pocket of about 28 amino acids 
which is located about 70 amino acids downstream from the putative N-terminus 

25 of the putative polyprotein. 

Although it may be argued that the detected variations were 
introduced during the amplification process, it is unlikely that all of the variations 
are from this result. It has been estimated tiiat Taq polymerase introduces errors 
into a sequence at approximately one base per 10 kilobases of DNA template per 

30 cyde (Saiki et al (1988)). Based upon tiiis estimate, up to 7 errors may have 
been introduced during the PGR amplification of the 1019 bp DNA fragment. 
However, die tiuee subclones of HCV-23 and HCV-27 yielded 29 and 14 base 
variations, respectively. The following suggest that tiiese variations are naturally 
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occurring. About 60% of the base changes are silent mutations which do not 
change the amino add sequence. Variations introduced by the Taq polymerase 
during PCR amplification would be eoqpected to occur randomly; however, the 
results show that the variant sequences are clustered in at least one specific region. 

5 

IV.C.2. HCV Isolates from Humans in Italy and in the U.S. 

Segments of HCV RNA present in different isolates were amplified 
by the HCV/cPCR method. These segments span a region of -0.6 Kb to - 1.6 
Kb downstream from the methionine encoding start codon of the putative HCV 

10 polyprotein. Hie isolates are from biological specimras obtained from HCV 
infected individuals. More specifically, isolate HCT #18 is from human plasma 
from an individual in the U.S.A., ECl and EClO are from a liver biopsy of an 
Kalian patient, and Hi is from a peripheral blood mononucleocyte fraction of an 
American pattern. Conq>arable segments of HCV SNA have bem isolated from a 

IS chimpanzee. 

SNA was extracted from the human plasma specimens using 
phenolrCHClarisoamyl alcohol extraction. Either 0.1 mL or 0.01 mL of plasma 
was diluted to a final volume of 1.0 mL, with a TENB/proteinase K/SDS solution 
(0.05 M Tris-HCL, pH 8.0, 0.001 M EDTA, 0.1 M NaCl, 1 mg/mL Proteinase 

20 K, and 0.5% SDS) containing 10 to 40 ftg/mL polyadenylic acid, and incubated at 
37°C for 60 minutes. After this proteinase K digestion, the resultant plasma 
fractions were deproteinized by extraction with TE (50 mM Tris-HCl, pH 8.0, 1 
mM EDTA) saturated phenol, pH 6.5. The phenol phase was separated by 
cratrifiigation, and was reextracted with TENB containing 0.1 % SDS, The 

25 resulting aqueous phases from each extraction were pooled, and extracted twice 
with an equal volume of phenol/chloroform/isoamyl alcohol [1:1(99:1)], and then 
twice with an equal volume of a 99:1 mixture of chlorofonn/isoamyl alcohol. 
Following phase separation by c^itrifugation, the aqueous phase was brought to a 
final concentration of 0.2 M Na acetate, and the nucleic acids were precipitated by 

30 the addition of two volumes of ethanoL The precipitated nucleic acids were 

recovered by ultracentrifiigation in a SW 41 rotor at 38 K, for 60 minutes at 4**C 
or in a microiuge for 10 minutes at 10 K, 4'*C. 
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RNA extracted ftom the liver biopsy was provided by Dr. F. 
Bonin , Ospedale Maggioie di S. Giovanni Battista, Torino, Italy. The 
mononucleoqlB fraction was obtained by sedimentation of the individual's aliquot 
of blood through FicpU-PKiue® (Pharmacia Corp), using the manufacturer's dir- 
5 ections. Total UNA was extracted from the ftaction usmg the guanidiniura tWo- 
cyanate procedure described in Choo et al (1989). 

Syndiesis of HCV cDNA from the san^iles was accomplished using 
reverse transcriptase. Following ethanol precipitation, the precipitated RNA or 
iradeic acid firaction was dried, and resuspended m DEPC treated distilled water. 

10 Secondary structures in the rmcleic acids were disrupted by heating at 65**C for 10 
minutes, and the samples were inunediately cooled on ice. cDNA was synthesized 
usmg 1 to 3 ;«g of total RNA ftom liver, or from nucleic acids (or RNA) extracted 
from 10 to 100 fiL of plasma. The synthesis utilized reverse transcriptase, and 
was in a 25 ^ reaction, using the protocol specified by the manufacturer, BRL. 

15 All reaction mixtures for cDNA synthesis contained 23 units of the RNAase 
inhibitor, Rnasm® (Fisher/Promega). Following cDNA synthesis, the reaction 
mixtures were diluted with water, boiled for 10 minutes, and quickly chilled on 
ice. 

Each set of samples was subjected to two rounds of PGR 
20 amplification. The primers for the reactions were selected to amplify regions 

designated "BivL" and EnvR". The "EnvL" region encompasses nucleotides 669- 
1243, and putative amino adds 117 to 308; the "EnvR" region encompasses 
nucleotides 1215-1629, and encodes putative amino acids 300-408 (tiie putative 
amino acids are numbered starting from the putative methionine irutiation codon). 
25 The PGR reactions were performed ess^tially according to the 

manufacturer's directions (Cetos-Perkin-Ehner), except for the addition of 1 ftg of 
RNase A. The reactions were carried out in a final volume of 100 /iL . The PGR 
was performed for 30 cycles, utilizing a regimen of 94^*0 (1 min), 37''C (2 min), 
and 72°C (3 min), with a 7 minute extension at 72^*0 for flie last cycle. The 
30 samples were then extracted with phenohCHClj, ethanol precipitated two times, 
resuspended in 10 mM Tris HCl, pH 8.0, and concentrated using Centricon-30 
(Anticon) filtration. This procedure dfidentiy removes oligonucleotides less than 
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30 nucleotides in size; thus, the primers from the first round of PCR amplification 
are removed. 

The Centricon-30 concentrated samples were then subjected to a 
second round of PCR amplification. Amplification by PCR was for 35 cycles 
5 utilizing a regimen of 94'*C (1 min), eO'^C (1 min), and 72T (2 min), with a 7 
minute ext^ion at 72''C for the last cycle. The samples were then extracted with 
phenol:CHa3, precipitated two tunes, and digested with EcoPI. The PCR 
reaction products were analyzed by separation of the products by electrophoresis 
on 6% polyaciylamide gels. DNA of qiproximately the estimated size of the 
10 expected PCR product was electroeluted from the gels, and subcloned into either a 
pGEM-4 plasmid vector or into XgtlL Hie expected product sizes for the ^vL 
and EnvR after the first round of amplification are 615 bp and 683 bp, 
respectively; after the second round of amplification the expected product sizes for 
EnvL and EnvR are 414 bp and 575 bp, respectively. The plasmids containing the 

15 amplified products were used to transfonn host cells; the pGEM-4 plasmid was 
used to transfonn DH5-a^ha, and Xgtll was used to transfonn C600 delta-HFL. 
Clones of the transformed cells which either hybridized to the appropriate HCV 
probes, or those which had inserts of the correct size were selected. The inserts 
were then cloned in M13 and sequenced. Hie probes for all of the HCV/cPCR 

20 products consisted of labeled sections of HCV cDNA which had been prepared 
by PCR an^lification. 

Sequence information on variants in the EnvL region was obtained 
from 3 clones from HCT #18, 2 clones ftom TH, 3 clones from ECl, and ftom 
the HCVl clones. A comparison of the composite nucleotide sequence of each 

25 isolate derived from these clones is shown in Fig. 7. In the figure, each sequence 
is shown 5' to 3' for the sense strand for the EnvL region, and the sequences have 
been aligned. The vertical lines and cq^ital letters indicate sequence homology, 
the absence of a line and an unc^italized letter indicates a lack of homology. The 
sequences shown in the lines are as follows: line 1, Thorn; line 2, ECl; line 3, 

30 HCT #18; line 4, HCVl. 

Sequence information on variants in the EnvR region was obtained 
ftom two clones of EClO, and from HCVl clones. The two EClO clones differed 
by only one nucleotide. A comparison of the nucleotide sequences of EC10(clone 
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2)andacoinpositBof theHCVl sequences is shown in Rg. 8; each sequence is 
shown 5' to 3' for the sense strand of the EnvR region, and the sequences have 
been aligned. The double dots between the sequences indicate sequence 
homology. 

5 A comparison of the amino add sequence encoded in the BivL 

(amino acids #117-308) and EnvR region (amino adds #300-438) for each of the 
isolates is shown in Hg. 9 and Fig, 10, iBSpectivdy. Included in the Figures are 
sequences for the isolates 3H23 and JH27, described above. Also indicated are 
sequences from a Japanese isolate; these sequences were provided by Dr. T. 

10 Miyamuia, Japan. In the figures, the amino acid sequence for the region is given 
in its entirety for HCVl , and the non-homologous amino adds m the various 
isolates are mdicated. 

As seen in Fig. 9, In the BivL region there is overall about a 93% 
homology between HCVl and the other isolates. HCT18, Th, and ECl have 

15 about a 9756 homology with HCVl; JH23 and JH27 have about 96% and about 
95% homology, respectively, with HCVl. Fig. 10 shows that the homologies in 
the EnvR region arc significantly less than in the EnvL region; moreover, one 
snbregion qrpears to be hypervaiiable {i.e., ficom amino add 383-405). This data 
is summarized in the Table immediately bdow. 

20 Table: Homology of BivR Region 

Isolate Percent Homology with- HCVl 
AA330-AA438 AA383-AA405 



JH23(U.S.) 83 57 

JH27(U.S.) 80 39 

25 J^)anese 73 48 

EClOgtaly) 84 48 

VI. Industrial Applicabilitv 

The epitopes identified herein can be used to make polypeptide 
30 products as desaibed above for applications such as the screening of blood for 
HCV infection, clinical HCV diagnosis, the generation of antibodies, and 
preparation of medicaments. Other applications are described above, and still 
others will be readily apparent to those of ordinary skill. 
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WWATTR<7T AiTV/ren- 

1. A polypeptide comprising a tnincated HCV sequence containing 
an HCV epitope of the fonnula 
5 a^x-aay 
wherein aa denotes an amino acid; 
X and y aie integers such that y-x ^ 6; 

33,-3% indicates a portion of the amino acid sequence of Figure 1; and 
X is selected from the groq> consistmg of 23-34, 36, 66-79, 81-94, 96- 
10 98, 101-103, 186-189, 191, 206, 223, 232, 256, 286, 297-299, 321, 347, 357, 
413, 414, 432, 465-471, 480-484, 501, 502, 521, 540-549, 579, 594-599, 601- 
613, 641, 662-665, 685, 705, 706, 729, 782-789, 801, 851-855, 893, 916, 928, 
946, 952-954, 1026, 1072, 1109, 1112-1117, 1218, 1240, 1280-1285, 1322, 1338, 
1371, 1384, 1410, 1411, 1454, 1492, 1493, 1532-1535, 1560, 1561, 1566-1568, 
15 1571-1577, 1601-1607, 1615-1620, 1655, 1695, 1710-1712, 1728, 1729, 1758- 
1762, 1781, 1808, 1821, 1851, 1880, 1908-1913, 1925, 1940-1948, 1951, 1966- 
1969. 1999, 2001-2004, 2006-2014, 2024, 2048-2053, 2055-2057, 2071, 2088- 
2093, 2108, 2122-2148, 2165, 2187, 2226-2232, 2244-2249, 2267, 2281-2286, 
2288, 2289, 2325-2327, 2346, 2347, 2349, 2382, 2401, 2417-2422, 2439-2444, 
20 2446-2456, 2469, 2471-2476, 2495, 2533, 2534, 2573-2578, 2602-2604, 2606- 
2612, 2632-2638, 2660, 2676-2679, 2688-2693, 2707, 2721, 2757-2762, 2779, 
2794, 2795, 2797-2799, 2801, 2802, 2817-2843, 2863-2867, 2878-2884, 2886- 
2895. 

25 2. The polypqptide of claim 1 which is about 100 amino acids or less 

in length. 

3. The polypeptide of claim 2 wherein y-x ^ 50. 
30 4. The polypq}tide of claim 2 wherein y-x ^ 20. 



5. The polypeptide of claim 2 wherein y-x < 10. 
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6. The polypqrtide of claim 2 wherein x is selected from the groop 
consisting of 506, 510. 523, 553, 562, 580, and 590-620. 

7. A polypq)tide of about 100 amino adds or less comprising an 
5 HCV epitape of the formula 

aax-aa^ 

wherein aa denotes an amino add; 
X and y are integers such that y-x ^ 6; 

aa^-aay indicates a portion of the amino add sequence of Hgure 1; and 
10 X is sdected from the group conasting of 35 (where y is less than 45), 

80 (where y is less than 90), 95 (where y is less than 110), 99 (where y is less 
than 120), 100 (where y is less than 150), 190 (where y is less than 210), 500 
(where y is less than 550), 600 (where y is less than 625). 1260 (where y is less 
than 1280), 1569 (where y is less than 1931), 1570 (where y is less than 1590), 
15 1694 (whrae y is less than 1735), 1949 (where y is less than 2124), 1950 (where y 
is less flian 1985), 2000 (where y is less than 2050), 2005 (where y is less than 
2025), 2054 (where y is less than 2223), 2250 (where y is less than 2330), 2287 
(where y is less than 2385), 2290 (where y is less than 2310), 2345 (where y is 
less than 2375), 2348 (where y is less than 2464), 2445 (where y is less than 
20 2475), 2470 (where y is less than 2490), 2605 (where y is less than 2620), 2780 
(where y is less than 2830), 2796 (where y is less than 2886), 2800 (where y is 
less tiian 2850). and 2885 (where y is less than 2905). 

8. An immunoassay reagent comprising a polypeptide according to 
25 any of claims 1-7. 

9. The immunoassay reagent of claim 8 wherein y-x ^ 50. 



30 



10. The immunoassay reagent of claim 8 wherein x is selected from 
the group consisting of 506, 510, 523, 553, 562, 580, and 590-620. 
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IL A m^od for detecting the piesmce of antibodies immunoieactive 
with Hqsatitis C virus (HCV) proteins in a sample, said method comprising: 

contacting an immobilized inmmnoassay leagmt according to claim 8 
with said sample; and 
5 detecting antibodies bomid to said reagent. 

12. A method for inducing an immunological response in a subject 
against HCV, said method comprising: 

10 administering to said subject an effective amount of a polypq^tide 

accoding to any one of claims 1-7. 

13. A composition for inducing an immunological response in a 
subject against HCV, said composition comprising an effective amount of a 

15 polypqrtide accoding to any one of claims 1-7. 

14. A monoclonal or polyclonal antibody composition wherein said 
antibodies bind the HCV epitope of a polypeptide according to claim 1. 

20 15. A method of making a polypeptide accordiag to aiiy of claims 1-7 

wherein said polypeptide is prqiared by recombinant e^ipression or chemical 
synthesis. 
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FIG, I - 1 



R T 

MSTNPKPQKKNKRNTNRRP0DVKFPGGGQIVGGVYLLPRRGPRIX3VRATR 
KTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 
RGS R P S WG PTDP RRRS RNLGKVIDTLTCGFADLMG YI PLVG APLG G AARA 

T 

LAHGVRVLEDGVNYATGNLPGCSFSIFLLAIjLSCLTVPASAYQVRNSTGL-200 
YHVTNDCPNSSIYYEAADAILHTPGCVPCVREGNASRCWVAMTPTVATRD 
GKLPATQLRRHIDLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-3 0 0 

V 

TQGCNCSIYPGHITGHRMAWDMMMNWSPTTALVMAQLLRIPQAILDMIAG 
AHWGVIAG IAYFSMVGNWAKVLVVIiLLFAGVDAETHVTGGSAGHTVSGFV-4 0 0 
SLLAPGAKQNVQLINTNGSWHLNSTALNCNDSLNTGWLAGLFYHHKFNSS 
GCPERLASCRPLTDPDQGWGP ISYANGSGPDQRPYCWHYPPKPCG IVP AK-50 0 
SVCGPVYCFTPSPVWGTTDRSGAPTYSWGENDTDVFVLNNTRPPLGNWF 
GCTWMNSTGFTKVCGAPPC7IGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

I 

PWLTPRCLVDYPYRLWHYPCTINYTIFKIRMYVGGVEHRLEAACNWTRGE 
RCDLEDRDRSELS PLLLTTTQWQVLPCSFTTLPALSTGLIHLHQN IVDVQ-7 0 0 
YLYGVGSSIASWAIKWEYVVLLFLLLADARVCSCLWMMLLISQAEAALEN 
LVILNAASLAGTHGLVSFLVTFCFAWYLKGKWVPGAVYTFYGMWPLLLLL- 80 0 
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LALPQRAYALDTBVAASCGGWLVGLMALTLSPYYKRYISWCLWWLQYFL 
TRVEAQLHVWI P PLNVRGGRDAVILLMCAVHPTLVFDITKLLLAVFGPLW- 90 0 
ILQASLLKVPYFVRVQGLLRFCALARKMIGGHYVQMVIIKLGALTGTYVY 
NHLTPLRDWAHNGLRDLAVAVEPWFSQMETKLITWGADTAACGDIINGL-1000 
PVSARRGREILLGPADGMVSKGWRLLAPITAYAQQTRGLLGCIITSLTGR 
DKNQVEG EVQ I VS TAAQTFLATCINGVC WTVYHG AGTRTIAS PKG P VIQM- 1100 

S T 

YThTVDQDLVGWPAPOGSRSLTPCTCGSSDLYLVTRHADVIPVRRRGDSRG 

SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 

LETTMRSPVFTDNSSPPWPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

L 

VLVLNPSVAATLGFGAYMSKAHGIDPNIRTGVRTITTGSPITYSTYGKFL-1300 



ADGGCSGGAYDI I ICDECHSTDATS ILG IGTVLDQAETAGARLWLATAT 

PP6SVTVPHPNIEEVALSTTGEIPFYGKAIPLEVIKGGRHLIFCHSKKKC-1400 

DELAAKLVALGINAVAYYRGLDVSVIPTSGDWWATDALMTGYTGDFDS 

V (S) 
VIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRTQRRGRTGRGKPGIYR-1500 
FVAPGERPSGMFDSSYLCECYDAGCAWYELTPAETTVRLRAYMNTPGLPV 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP- 1600 
PPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQNEITLTHPVTKYIMTCMS 
ADLEVVTSTWVLVGGVLAALAAYCLSTGCVVIVGRVVLSGKPAIIPDREV-1700 
LYREFDEMEECSQHLPYIEQGMMLAEQFKQKALGLLQTASRQAEVIAPAV 
OTNWQKLETFWAKHMWNFISG IQYLAGLSTLPGNPAIASimFTAAVTSP-18 GO 
LTTSQTLLFNILGGWVAAQLAAPGAATAFVGAGLAGAAIGSVGLGKVLID 

(G) 

I LAG YG AGVAG ALVAFK IMSG EVPSTEDLVNLLP AILS PGALWG WCAA- 1900 

(HC) 

ILRRHVGPGEGAVQW^WRL•IA?ASRGNHVSPTHYVPESDAAARVTAILSS 



FIG. I -2 
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LTVTOLLRRLHQW ISS ECTTPCSGSWLRD IWDWICEVLSDFKTWLKAKLM- 20 0 0 

(V) 

POLPGIPFVSCQRGYKGVWRGDGIMHTRCHCGAEITGHVKNGTMRIVGPR 

TCRNMWSGTFPINAYTTGPCTPLPAPNyTFALWRVSAEEYVEIRQVGDFH-2100 

YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 

LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 

SSASOLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV 

VILDSFDPLVAEEDEREISVPAEILRKSRRFAQALPVWARPDYNPPLVET-2300 

(S) 

WKKPDYEPPWHGCPLPPPKSPPVPPPRKKRTWLTESTLSTALAELATR 

(FA) 

SFGSSSTSGITGDNTTTSSEPAPSGCPPDSDAESYSSMPPLEGEPGDPDL-24 00 

SDGSWSTVSSEANAEDWCCSMSYSWTGALVTPCAAEEQKLPINALSNSL 

LRH HNLVYSTTSRSACQRQKKVTFDRI^VLDSHYQDVLKEVKAAASKVKA- 2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRCHARKAVTHINSVWKDLLEDN 

VTP IDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDWT-26 00 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 



SDIRTEEAIYQCCDLDPOARVAIKSLTERLYVGGPLTNSRGENCGYRRCR-2700 

ASGVLTTSCGNTLTCYIKARAACRAAGLQDCTMLVCGDDLWICESAGVQ 

EDAASLRAFTEAMTRYSAPPGDPPQPEYDLELITSCSSNVSVAHDGAGKR-2800 

VYYLTRDPTTPLARAAWETARHTPVKSWLGNIIMFAPTLWARMILMTHFF 

SVLIARDOLEQALDCEIYGACYSIEPLDLPPIIQRLHGLSAFSLHSYSPG-2900 

G 

EINRVAACLRKI/3VPPLRAWRHRARSVRARLLARGGRAAICGKYLFNWAV 



RTKLKLTPIAAAGQLDLSGWFTAGYSGGDIYHSVSHARPRWIWFCLLLLA-3000 
AGVG I YLLPNRO- 30 11 

I 

Stop codon 



( ) - Heterogeneity due possibly 

to 5' or 3' terminal cloning 
artefact. 

FIG. I -3 
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10 20 30 40 

EClO GAATTCGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 

HCVl CTCTCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATC^ 

550 560 570 580 590 600 



50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTAGT 
::: :::::;::::::::::::::::::::::::;;::::::::;::•:::;::::: ;: 

AACGGGTCACCGCATGGCATGGGATATGAIGAOXsAACTGGTCCCCTACGACGGCGTTGGT 
610 620 630 640 650 660 

110 120 130 140 150 160 

GGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTC 
: :::::::::::::::::::::::::::::::;::::::::::;:;::::::;:::: 

AATGGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTCCTCACTG 
670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTGGC 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGT6GGGAACTGGGCGAAGGTCCTGGT 
730 740 750 760 770 780 

230 240 250 260 270 280 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 

: : . . . . .... 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGG 
790 800 810 820 830 840 

290 300 310 320 330 340 

CAAAACTACGGCTAGCCTTACTGGTCTCTTCAATTTAGGTCCCAAGCAGAACATCCAGCT 
'• • ' '• • '• ' ' •'' i'-i :::::::::::: ::::::: 

CCACACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCT 
850 860 870 880 890 900 

350 360 370 380 390 400 

GATCAACACCAACGGCAGTTGGCACATCAACAGGACGGCCTTGAACTGCAATGATAGCCT 
ll^l::lllL''l''' ''''' 

GATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCT 
910 920 930 940 950 960 

410 420 

CAACACCGGCTGGAATTC 

CAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 

FIG,8 
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A A #117-308 fpufeitivft ftnvftloDB rpninn) FIG.9 

1 ) HCT #1 8 (USA) 3 dones sequenced 

2) JH23 (USA) ? 

3) JH 27 (USA) ? 

4) PBL-Th (USA) 2 dones sequenced 

5) EC1 (Italy) 3 clones sequenced 

6) HCV-1 (chimpanzee) multiple 

C/M4-j->S 

1) (tt 

2) 

3) 

4) 
5) 

QRNLGKVIDTLTOSFADLMGYIPLVGAPLGGiAARALAHGVRVLEDGVNYATGNL 

1) H 

2) 

3) S T T 

4) L 

5) (F) s 

6) PGCSreiFlXALLSCLTVPASAYQVRNSTGLYHVTNDCPNSS[\A'EAADAILH 

1) (H) V V T 

2) a D V V K T 

PVA N 

A R T 



3) S 

4) A 

5) H 



* T 
ejTPGCVPCVREGNASRCWVAMTPTVATRDGKLPATQLRRHIDLLVGSATT.CS 

1) 

2) I D 

3} D 
4) 

5) I 

6) ALYVGDLC6SVFT.VGQLFTFSPRRHWTTQGCNCSI 
SUMMARY: "S" AA117-308 (93Z) 

HCTi»l8, PBL-Th. ECl (Italy) have 97% homology with HCV-1 

JH23 and JH 27 have 96Z and 95X homology with HCV-I, respectively 
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AA#3Q0.AaB ( Ctarminal ra^inn of th» niitatiuft ftnvalnnft ra^inn anri 
amino -.1/3 of NSH 



1) JH23 

2) JH27 

3) Japanese isolate (T. Miyamura) 

4) EC10 (Italy) 



5) HCV-1 (chimpanzee) 



A 

A 



1) D 

2) D 

3) VS 

4) 

5)TTCXaCNCSiYPGHrrGHRMAV\roMMMNWSPTTALVMACXJJVPC^ 



? 
? 
? 

2 Clones sequenced 

(one nt difference, which did not 

result in an amino acid change) 

multiple 

»NSI 



VM V 



1) M R 

2) T VT 

3) L Y I M QH R 

4) A I AK TASLTA 

5) HWGVLAGIAYFSMVGNWAKVLVVLJ±FAGVbAETHVTGGSAGHTVSGFVSL 



ARSTA VA 
N A R TQALT F 
VQ VT TLT 



i 




T 


V 


1 


R 


A 0 




1 


R 


Q 


F 


1 


R 




N 



1 )FS R I 

2) FT 01 

3) FR S Kl V 

4) FNL I 

5) I^PGAKQrn/QUhn-N6SVVHU^TALNCh©SLNTGWL 

SUMMARY: NS 1 AA 330-660 

"Isolate" ZHomology (AA330-438) ZHomology (AA 383-405) 



JH23 83 

JH27 80 

Japanese 73 

EClO (Italy) 84 



57 
39 
48 
48 
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